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METHODS TO MEDIATE POLYKETIDE SYNTHASE 
MODULE EFFECTIVENESS 

Cross-Reference to Related Applications 

[0001] This patent application is a continuation-in-part application of copending U.S. 
patent application 09/500,747, filed 9 February 2000; which claims the benefit of the 
filing date of U.S. Provisional Application No. 60/119,363, filed 9 February 1999. 
Further, this application claims the benefit of the filing date of U.S. Provisional 
Application Nos. 60/272,985 and 60/272,987, both filed 2 March 2001. Each of these 
applications is incorporated herein by reference. 

Statement of Rights to Inventions Made Under Federally Sponsored Research 
[0002] The invention herein was made, at least in part, based on support by grants 
CA-66736, GM-22172, and GM-22176 from the National Institutes of Health, and grant 
BES-9806774 from the National Science Foundation. The U.S. government may have 
certain rights in the invention. 

Technical Field 

[0003] The invention is directed to facilitating usage by polyketide synthase modules 
of nascent polyketide chains. Specifically, the invention concerns including intermodule 
and intramodule linkers in constructions for synthesis of desired polyketides. More 
specifically, the invention concerns the effects of protein-protein interactions and 
enzyme-substrate interactions in the channeling of intermediates between polyketide 
synthase modules. 

Background of the Invention 

[0004] Polyketides are a class of compounds synthesized from 2-carbon units through 
a series of condensations and subsequent modifications. Polyketides occur in many types 
of organisms, including fungi and mycelial bacteria, in particular, the actinomycetes. 
Polyketides are biologically active molecules with a wide variety of structures, and the 
class encompasses numerous compounds with diverse activities. Tetracycline, 
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erythromycin, epothilone, FK-506, FK-520, narbomycin, picromycin, rapamycin, 
spinocyn, and tylosin are examples of polyketides. Given the difficulty in producing 
polyketide compounds by traditional chemical methodology, and the typically low 
production of polyketides in wild-type cells, there has been considerable interest in 
finding improved or alternate means to produce polyketide compounds. 
[0005] The biosynthetic diversity of polyketides is generated by repetitive 
condensations of simple monomers by polyketide synthase (PKS) enzymes that mimic 
fatty acid synthases. For instance, the deoxyerythronolide-B synthase catalyzes the chain 
extension of a primer with several methylmalonyl coenzyme A (MeMalCoA) extender 
units to produce the erythromycin core. 

[0006] The cloning, analysis, and recombinant DNA technology of genes that encode 
PKS enzymes allows one to manipulate a known PKS gene cluster either to produce the 
polyketide synthesized by that PKS at higher levels than occur in nature or in hosts that 
otherwise do not produce the polyketide. The technology also allows one to produce 
molecules that are structurally related to, but distinct from, the polyketides produced from 
known PKS gene clusters. See, e.g., PCT publication Nos. WO 93/13663; 95/08548; 
96/40968; 97/02358; 98/27203; and 98/49315; United States Patent Nos. 4,874,748; 
5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146; 5,830,750; and 5,843,718; and 
Fu, et aL 9 1994, Biochemistry 33: 9321-9326; McDaniel, et aL, 1993, Science 262: 
1546-1550; andRohr, 1995, Angew. Chem. Int. Ed. Engl 34(8): 881-888, each of which 
is incorporated herein by reference. 

[0007] PKSs catalyze the biosynthesis of polyketides through repeated, 
decarboxylative Claisen condensations between acylthioester building blocks. The 
building blocks used to form complex polyketides are typically acylthioesters, such as 
acetyl, butyryl, propionyl, malonyl, hydroxymalonyl, methylmalonyl, and ethylmalonyl 
CoA. Two major types of PKS enzymes are known; these differ in their composition and 
mode of synthesis of the polyketide synthesized. These two major types of PKS enzymes 
are commonly referred to as Type I or "modular" and Type II "iterative" PKS enzymes. 
[0008] The present invention concerns modular PKS. In the Type I or modular PKS 
enzyme group, a set of separate catalytic active sites (each active site is termed a 
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"domain", and a set thereof is termed a "module") exists for each cycle of carbon chain 
elongation and modification in the polyketide synthesis pathway. The typical modular 
PKS is composed of several large polypeptides, which can be segregated from amino to 
carboxy terminii into a loading module, multiple extender modules, and a releasing (or 
5 thioesterase) domain. The PKS enzyme known as 6-deoxyerythronolide B synthase 

(DEBS) is a typical Type I PKS. In DEBS, there is a loading module, six extender 
modules, and a thioesterase (TE) domain. The loading module, six extender modules, 
and TE of DEBS are present on three separate proteins (designated DEBS-1, DEBS-2, 
and DEBS-3, with two extender modules per protein). Each of the DEBS polypeptides is 
10 encoded by a separate open reading frame (ORF) or gene; these genes are known as 

Q eryAI, eryAII, and eryAIIL See Figure 1 . There is considerable interest in the genetic and 

£T chemical reprogramming of modular PKSs (see, e.g., Khosla, 1997, Chem. Rev. 97:2577- 

W 2590, and Staunton, etaL, 1997, Chem. Rev. 2611-2629, each of which is incorporated 

45 herein by reference) . 

l % 1 5 [0009] Generally, the loading module is responsible for binding the first building 

block used to synthesize the polyketide and transferring it to the first extender module. 

|3 

3* The loading module of DEBS consists of an acyltransferase (AT) domain and an acyl 

carrier protein (ACP) domain. Another type of loading module utilizes an inactivated 
KS, an AT, and an ACP. This inactivated KS is in some instances called KS Q , where the 

20 superscript letter is the abbreviation for the amino acid, glutamine, that is present instead 

of the active site cysteine required for ketosynthase activity. In other PKS enzymes, 
including the FK-520 PKS, the loading module incorporates an unusual starter unit and is 
composed of a Co A ligase activity domain. In any event, the loading module recognizes 
a particular acyl-CoA (usually acetyl or propionyl but sometimes butyryl) and transfers it 

25 as a thiol ester to the ACP of the loading module. 

[0010] The AT on each of the extender modules recognizes a particular extender- 
CoA (malonyl or alpha- substituted malonyl, i.e., methylmalonyl, ethylmalonyl, and 
hydroxymalonyl) and transfers it to the ACP of that extender module to form a thioester. 
Each extender module is responsible for accepting a compound from a prior module, 

30 binding a building block, attaching the building block to the compound from the prior 
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module, optionally performing one or more additional functions, and transferring the 
resulting compound to the next module. The transfer into a module is mediated by the 
KS domain which is upstream of the remaining catalytic domains. The additional 
functions are performed by enzymes which comprise a ketoreductase (KR) which reduces 
the carbonyl group generated from the condensation to an alcohol, a dehydratase (DH) 
which converts the alcohol to a double bond, and an enoyl reductase (ER) which reduces 
the double bond to a single bond. These catalytic domains appear to be immediately 
adjacent and not separated by any linking sequences. Collectively, they can be called 
"beta-carbonyl modifying" domains. Thus, a particular module may contain none of 
these activities, only KR, or KR+DH, or KR+ DH+ER. Thus, the order of domains from 
the N-terminus of a particular module is KS, AT, beta-carbonyl modifying domains (if 
present), ACP. The order, C of the beta-carbonyl modifying enzymes is DH ER KR. 
[0011] Thus, each extender module of a modular PKS contains zero, one, two, or 
three enzymes that modify the beta-carbon of the growing polyketide chain downstream 
of the AT catalytic domain. A typical (non-loading) minimal Type I PKS extender 
module is exemplified by extender module 3 of DEBS, which contains only a KS 
domain, an AT domain, and an ACP domain. The next extender module, module 4, 
contains all three beta-carbonyl modifying enzymes. (The beta-carbonyl modifying 
enzymes effect such modification on the extender unit that has been added by the 
previous module.) 

[0012] Once the PKS is primed with acyl- and malonyl-ACPs, the acyl group of the 
loading module migrates to form a thiol ester (trans-esterification) at the KS of the first 
extender module; at this stage, extender module one possesses an acyl-KS adjacent to a 
malonyl (or substituted malonyl) ACP. The acyl group derived from the loading module 
is then covalently attached to the alpha-carbon of the malonyl group to form a carbon- 
carbon bond, driven by concomitant decarboxylation, and generating a new acyl- ACP 
that has a backbone two carbons longer than the loading building block (elongation or 
extension). 

[0013] After traversing the final extender module, the polyketide encounters a 
releasing domain that cleaves the polyketide from the PKS and typically cyclizes the 
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polyketide. For example, final synthesis of 6-dEB is regulated by a TE domain located at 
the end of extender module six. In the synthesis of 6-dEB, the TE domain catalyzes 
cyclization of the macrolide ring by formation of an ester linkage. In FK-506, FK-520, 
rapamycin, and similar polyketides, the ester linkage formed by the TE activity is 
5 replaced by a linkage formed by incorporation of a pipecolate acid residue. The 

enzymatic activity that catalyzes this incorporation for the rapamycin enzyme is known 
as RapP, encoded by the rapP gene. The polyketide can be modified further by tailoring 
enzymes; these enzymes add carbohydrate groups or methyl groups, or make other 
modifications, i.e., oxidation or reduction, on the polyketide core molecule. For example, 

J: 10 6-dEB is hydroxylated at C6 and C 12 and glycosylated at C3 and C5 in the synthesis of 

C3 erythromycin A. 

ipS [0014] In PKS polypeptides, the regions that encode enzymatic activities (domains) 

5 2 are separated by linker or "scaffold"-encoding regions. These scaffold regions encode 

45 amino acid sequences that space the domains at the appropriate distances and in the 

■s. 

f ~. 1 5 correct order. Thus, the linker regions of a PKS protein collectively can be considered to 

encode a scaffold into which the various domains (and thus modules) are placed in a 
42 particular order and spatial arrangement. Generally, this organization permits PKS 

5t catalytic domains of different or identical substrate specificities to be substituted (usually 

at the DNA level) between PKS enzymes by various available methodologies. Thus, 
20 there is considerable flexibility in the design of new PKS enzymes with the result that 

known polyketides can be produced more effectively, and novel polyketides useful as 

pharmaceuticals or for other purposes can be made. 

[0015] PCT publication WO 98/493 1 5, the contents of which are incorporated herein 
by reference, describes an approach for modifying the enzymatic activities included 

25 within modules of a PKS by maintaining the scaffolding intact but replacing catalytic 

domains with different catalytic domains. U.S. Serial No. 09/346,860 filed 2 July 1999 
and the corresponding PCT publication WO 00/01838, also filed on that date, and 
incorporated herein by reference describe alternative methods by altering the 
hypervariable region of the AT domains so as to alter the specificity for an extender unit 

30 and alteration of the KS domains to control stereochemistry. The present invention takes 
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advantage of the approach of manipulating modules so that the catalytic activities of an 
entire module are placed in the appropriate sequence to construct a desired polyketide. 
The ability to utilize this approach depends on effecting an appropriate means for the 
module to incorporate a growing polyketide chain, which involves assuring that an 
appropriate linker region is included. Since the filing of the provisional application from 
which the present application claims priority, a related paper has been published by 
Ranganathan, A., et ah, Chem. & Biol (1999) 6:731-741. In this paper, intrapolypeptide 
linkages are fortuitously supplied to chimeric modules by including the KS region of the 
native downstream module in a chimera between the corresponding upstream module and 
the portions downstream of the KS domain in a heterologous module. Alternatively, the 
downstream module will include the ACP catalytic domain of the native upstream 
module fused to the remainder of a heterologous module upstream in the chimera. 

Background Information 

[0016] The following articles provide information relating to the invention: Aparicio, 
J. F., et aU (1996) Gene 169, 9-16; Cortes, J., et al, (1990) Nature 348, 176-178; 
Donadio, S., et al, (1991) Science 252, 675-679; Gokhale, R. S., et al., (2000) Curr. 
Opin. Chem. Biol 4, 22-27. 

Abbreviations 

[0017] 6-dEB: 6-deoxyerythronolide B; ACP: acyl carrier protein; AT: 
acyltransferase; DEBS: 6-deoxyerythronolide B synthase; DH: dehydratase; ER: 
enoylreductase; KR: ketoreductase; KS: ketosynthase; NAC: N-acetylcysteamine; 
NRPS: nonribosomal peptide synthetase; PCP: peptidyl carrier protein; PKS: polyketide 
synthase; ACP: acyl carrier protein; ER: enoylreductase; LDD: loading didomain; TE: 
thioesterase; M2: module 2 of DEBS; M2(4): module 2 with C-terminal linker from 
module 4; M3+TE: module 3 fused to thioesterase; (5)M3+TE: module 3 with N- 
terminal linker from module 5; M2:M3: complex of module 2 and module 3; and NDK: 
(2S,3R)-2-methyl-3-hydroxypentanoic acid diketide. 
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Disclosure of the Invention 

[0018] The invention is directed to an efficient method for constructing an arbitrarily 
chosen polyketide synthase, and therefore a desired polyketide, by manipulating entire 
modules of Type I polyketide synthases. The invention enables this approach by 
providing the modules with the appropriate "lead-in" or linker sequence to the 
ketosynthase (KS). Applicants have discovered that the appropriate linker between 
modules is required upstream of the relevant KS in order to permit the module to accept 
the nascent polyketide chain, and, in the case of intermolecular transfer, appropriate 
pairing of N-terminal and C-terminal regions assures the appropriate transfer. The nature 
of this linker varies depending on whether the module is covalently linked downstream 
from another module, or whether it forms the N-terminus of the polypeptide. 
[0019] Thus, in one aspect, the invention is directed to a method to construct a 
functional polyketide synthase which method comprises providing each module 
contained in the desired polyketide synthase with an appropriate intrapolypeptide linker 
(RAL) when said module is downstream in the same polypeptide from a module derived 
from a different PKS and with an appropriate interpolypeptide linker (ERL) when the 
module is derived from a PKS where the module is the N-terminal module of a 
polypeptide. If the module at the N-terminus of a polypeptide is to accept a nascent 
polyketide chain from an upstream module, the interpolypeptide linker needs to include 
the appropriate amino acid sequence at the C-terminus of the module donating the 
nascent chain. 

[0020] In describing a "module" being provided with linker(s) the term "module" 
refers to the functional portions extending approximately from the N-terminus of the KS 
catalytic region to the C-terminus of the ACP - i.e., excludes the linker portions otherwise 
considered a portion of the module. 

[0021] As further described below, any order of modules of desired specificity can be 
assured by providing the appropriate linkers either intermolecularly or intramolecularly. 
Thus, the polyketide synthase can be assembled from individual modules by providing 
the appropriate linkers to assure that the polyketide chain will be passed in the correct 
sequence from one module to the next and by assembling these modules either by directly 
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providing the polypeptides containing them or by co-expressing nucleotide sequences and 
coding them in a host cell. 

[0022] In other aspects, the invention is directed to materials and compositions useful 
in carrying out the method, in particular to isolated DNA fragments which contain the 
appropriate intrapolypeptide and interpolypeptide linkers. The invention also relates to 
methods to construct functional polyketide synthases from libraries of modules and to 
polyketides prepared by supplying appropriate substrates to reconstructed polyketide 
synthases. The polyketides thus prepared can be "tailored" using either isolated enzymes 
or feeding the polyketides to an organism containing these enzymes to convert them to 
anti-infectives or compounds of other activities such as motolides by such post- 
polyketide modifications as hydroxylation and glycosylation. The ketides or ketolides or 
their modified forms can also be further derivatized using chemical synthetic methods. 
[0023] In other apects, the invention is directed towards the C- and N-terminal ends 
of adjacent PKS polypeptides capped by peptides of 20-40 residues. Mismatched 
sequences abolish intermodular chain transfer without affecting the activity of individual 
modules, whereas matched sequences can facilitate the channeling of intermediates 
between ordinarily non-consecutive modules. 

[0024] In yet another aspect, the invention is directed towards the role of protein- 
protein interactions in substrate channeling and more specifically to assays or methods to 
assess the steady-state kinetic parameters of individual DEBS modules when primed in a 
channeling modes versus a diffusive mode. The diffusive process precludes the 
involvement of the covalent, substrate channeling mechanism by which enzyme-bound 
intermediates are directly transferred from one module to the next in a multi-modular 
PKS. These methods can be used to quantify the kinetic benefit of linker-mediated 
substrate channeling in a modular PKS. 

[0025] In another aspect, the invention is directed towards the ability of a synthetic 
peptide to inhibit tetraketide production. For example, a peptide corresponding to the N- 
terminal linker of module 3 was synthesized and shown to inhibit the formation of 
tetraketide lactone 2 (as shown in Figure 6) in the presence of M2 and M3+TE in a 
concentration-dependent manner. 
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Brief Description of the Drawings 

[0026] Figure 1 is a diagram of erythromycin PKS which forms 6-dEB. 
[0027] Figure 2 shows the conversion of a diketide thioester to the triketides 
corresponding to those produced by DEBS-1 of the erythromycin PKS. 
[0028] Figure 3 shows sequences of intrapolypeptide and interpolypetide linkers 
derived from various Type I PKS. 

[0029] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. 

[0030] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 
Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. 
[0031] Figure 6 is a schematic diagram of the interpolypeptide transfer with matched 
and mismatched linker pairs. 

[0032] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) M2(4) 
with (5)M3 + TE. 

[0033] Figure 8 shows selective inhibition by a synthetic peptide mimic of the N- 
terminal linker of M3. 

[0034] Figure 9 provides the CD spectrum of a peptide mimic showing the minima at 
208 and 222 nm indicative of GJ-helical character. 

[0035] Figure 10 presents schematic illustrations of three mechanisms of loading a 
DEBS module with a diketide 

[0036] Figure 1 1 illustrates four diketides and their corresponding, putative 
enzymatic products. 

[0037] Figure 12 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5+TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl+TE)-and their corresponding kc at values. 

[0038] Figure 13 presents a comparison of the kc at /KM values (min" mM" 1 ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when presented 
as NAC-thioesters (2a and 2b). 
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[0039] Figure 14 presents a comparison of the kc at values (min" 1 ) of the four diketides 
when presented as acyl-ACP substrates (4a-d) vs when presented as NAC-thioesters (2a- 
d)- 

[0040] Figure 1 5 presents conditions, results, and proposed mechanisms of back- 
transfer experiments. 

[0041] Figure 16 is a schematic of 6-deoxyerythronolide B synthase. 
[0042] Figure 17 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. 

[0043] Figure 18 is a schematic illustrating the channeling of intermediates to 
unnatural recipient modules. 

[0044] Figure 19 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. 

[0045] Figure 20 is table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. 

[0046] Figure 21 shows data supporting a coiled-coil conformation for linkers. 
[0047] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. 

Detailed Description of the Drawings 

[0048] Figure 1 is a diagram of the erythromycin PKS which forms 6-dEB, the core 
precursor of erythromycin. As shown, the PKS is comprised of three proteins, DEBS-1, 
DEBS-2 and DEBS-3 which are encoded by three genes, commonly called eryAI, eryAII 
and eryAIIL 

[0049] Figure 2 shows the conversion of a diketide thioester to the triketides 

corresponding to those produced by DEBS-1 of the erythromycin PKS. 

[0050] Figure 3 shows the structures of intrapolypeptide linkers and the N-terminal 

portions of interpolypeptide linkers (SEQ ID NOS:3-19) derived from various Type I 

PKS. 

[0051] Figure 4 is a schematic diagram of the biosynthesis of 6-dEB by 6- 
deoxyerythronolide B synthase. Each polypeptide, DEBS1, DEBS2, and DEBS3, 
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contains two modules, and each module comprises a set of active site domains 
responsible for addition and modification of an extender unit. The short "linker" regions 
are located at the N- and C-termini; their shapes exemplify the complementarity 
demonstrated by each pair. 
5 [0052] Figure 5 presents a velocity vs. NDK (mM) plot showing the fit of the 

Michaelis-Menten equation for M3 + TE and (5)M3 + TE alone. The nearly identical kc at 
values of 0.71 and 0.68 min" 1 and K M values of 2.8 and 2.5pM demonstrate the 
interchangeability of the linkers for individual modules. 

[0053] Figure 6 presents a schematic diagram of the interpolypeptide transfer with 
1^ 10 matched and mismatched linker pairs: (A) wild-type linker pair, M2 C-terminus + M3 N- 

y terminus; (B) mismatched pair, M2 C-terminus + M5 N-terminus; (C) mismatched pair, 

:; e 

iJ2 M4 C-terminus + M3 N-terminus; (D) matched pair, M4 C-terminus + M5 N-terminus. 

jy [0054] Figure 7 provides saturation curves of (A) M2 with M3 + TE and (B) M2(4) 

^ with (5)M3 + TE showing the effect of increasing M3 concentration on the overall rate of 

% 1 5 turnover. From these plots, the saturating rates of 0.27 and 0.74 min 1 were determined, 

fTf as were the Kd values of 1 . 1 and 2.1 . 

[0055] Figure 8 shows selective inhibition by the synthetic peptide mimic of the N- 
Q terminal linker of M3. The linker lowered the overall rate of tetraketide production in a 

dose-dependent fashion for the transfer using the M2-M3 linker pair. However, it 
20 demonstrated no such effect when tetraketide production depended upon interpolypeptide 

transfer using the M4-M5 linker pair. 

[0056] Figure 9 provides the CD spectrum of the peptide mimic showing the minima 
at 208 and 222 nm indicative of a-helical character. Though only ca. 50% helical, its 
structural features correlate with the expected coiled-coil motif. Furthermore, its ability 
25 to selectively inhibit interpolypeptide transfer verified some recognition ability of the 

mimic. 

[0057] Figure 10 presents schematic illustrations of the three mechanisms of loading 
a DEBS module with a diketide. (A) In a diffusive mechanism, diketides that have been 
activated as N-acetylcysteamine thioesters (diastereomers 2a-d) are loaded exogenously 
30 onto the KS domain. Claisen-like condensation with a C3-unit derived from 
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methylmalonyl CoA followed by NADPH-dependent reduction gives the corresponding 
triketide lactone products (3a-d). (B) In an intrapolypeptide channeling mechanism, a 
diketide that is generated by module 1 from propionyl CoA, methylmalonyl CoA, and 
NADPH is passed intramolecularly from ACPI to KS2. Subsequent elongation and 
reduction afford the triketide lactone 3a. (C) In an interpolypeptide channeling 
mechanism, diketides that have been chemoenzymatically attached to ACP4 by Sfp (4a- 
d) are transferred to the KS domain on a separate polypeptide. Elongation and reduction 
afford the corresponding triketide lactones (3a-d). In all cases suffix "a" refers to the 
(2S,3R) diastereomer, suffix "b" refers to the (2R,3S) diastereomer, suffix "c" refers to 
the (2S,3S) diastereomer, and suffix "d" refers to the (2i?,3i?) diastereomer. See also 
Figure 11. 

[0058] Figure 1 1 illustrates the four diketides and their corresponding, putative 
enzymatic products. See also caption to FigurelO. 

[0059] Figure 12 presents the reaction schemes of the three bimodular DEBS 
derivatives-Ml+M5+TE (module 1 + module 5 + TE), M1+M6+TE, and M1+M2+TE 
(DEBSl-KTE)-and their corresponding kc at values. 

[0060] Figure 13 presents a comparison of the kc a t/K M values (min -1 mM" 1 ) of the 
two syn-diketides when presented as acyl-ACP substrates (4a and 4b) vs when presented 
as NAC-thioesters (2a and 2b). The kcat/K M values for the NAC-thioesters were reporter 
earlier by Wu, N., et ah, J. Am. Chem. Soc, 2000, 122, 4847-4852. 
[0061] Figure 14 presents a comparison of the k^t values (min' 1 ) of the four diketides 
when presented as acyl-ACP substrates (4a-d) vs when presented as NAC-thioesters (2a- 
d). "N.D." denotes that the product was not detected. 

[0062] Figure 15 presents the following: (A) X-ray film image of SDS-PAGE gel and 
associated conditions of back-transfer experiments; and (B) proposed mechanism for 
back transfer of an exogenously loaded diketide from the KS of a formally downstream 
module to an upstream ACP. 

[0063] Figure 16 is a schematic of 6-deoxyerythronolide B synthase. Intermodular 
paris are shown in color. LD: loading domain; AT:acyltransferase; ACP: acyl carrier 
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protein; KS: ketosynthase; KR: ketoreductase; DH: dehydratase; ER: enolyreductase; and 
TE: thioesterase. 

[0064] Figure 17 is a schematic of matched and mismatched linker pairs at 
intermodular interfaces. Figure 17 shows how linker pairs at intermodular interfaces are 
selective, yet interchangeable. The left hand panel shows how matched linker pairs 
(triangular/triangular or square/square) facilitate efficient chain transfer. The right hand 
panel shows how mismatched linker pairs (triangular/square or square/triangular) abolish 
chain transfer without affecting the activity of individual modules. 
[0065] Figure 1 8 is a schematic illustrating the channeling of intermediates to 
unnatural recipient modules. Linker pairs can channel intermediates to unnatural 
recipient modules. The left hand panel shows that in addition to communicating with its 
natural partner, module 3 (as shown in Figure 17), module 2 can also communicate with 
modules 5 or 6 as long as linker pairs are matched. The right hand panel shows that such 
communication is totally abolished if unmatched linker pairs are used. 
[0066] Figure 19 is a schematic illustrating the replacement of the entire donor 
module with just the acyl carrier protein. The properties of linkers as shown in Figures 
17 and 18 are maintained when the entire donor module is replaced with just the acyl 
carrier protein. 

[0067] Figure 20 is table summarizing the substrates and enzymes used to study 
substrate transfer from a donor to a recipient module. Figure 20 shows how matched 
linker pairs are able to efficiently transfer otherwise poor substrates from a donor to a 
recipient module. Three different diastereomers of the natural substrate of module 2 were 
presented to either module 2 or module 5. These substrates were presented as N- 
acetylcysteamine (NAC) thioesters or on an ACP carrying a matched linker. In every 
case chain transfer via protein-protein interactions proved to be a useful method for 
substrate delivery. Products A , B, and C were producted at lower rates than the other 
products. 

[0068] Figure 21 shows data supporting a coiled-coil conformation for linkers. 
Circular dichroism and ultracentriguation on synthetic peptides indicate a coiled-coil 
conformation for linkers. The top panel shows circular dichroism (millidegrees) versus 



sd-81206 v2 



- 13- 



PATENT 

Atty. Dkt: 300622004620 



wavelength (nm) for Mix, M2C, and FullM3N. The bottom left panel shows InA versus 
radius 2 (mm 2 ) for linkers at module 2 C-terminus. The bottom right panel shows InA 
versus radius (mm ) for linkers at module 3 N-terminus. 

[0069] Figure 22 is a schematic illustrating how exogenous peptide mimetics of 
linkers can inhibit chain transfer. This figure suggests that the linkers can adopt a 
functional, native conformation. The top panel shows a natural transfer and the bottom 
panel shows how the addition of an exogenous peptide inhibits the transfer. 

[0070] Other features and advantages of the invention will be apparent from the 
following detailed description, and from the claims. 

Modes of Carrying Out the Invention 

[0071] The invention takes advantage of the identification of the amino acid 
sequences for supplying an appropriate linker between modules of a Type I PKS 
depending on the position of the module in the synthetic scheme for the polyketide. If 
the module is at the N-terminus of the polypeptide in which it resides - i.e., there is no 
additional module covalently bound upstream to it, an "interpolypeptide linker"(ERL) is 
placed upstream of the KS catalytic domain. Conversely, if the module resides in a 
polypeptide wherein there is an additional module upstream of it and covalently linked to 
it as a fusion protein, the two modules should be separated by an "intrapolypeptide 
linker" (RAL). If the module residing at the N-terminus of a polypeptide is downstream 
in the synthesis process for a polyketide - Le. 9 if it must accept a nascent polypeptide 
chain from a different module not on the same molecule, it may be necessary as well to 
supply a portion of the interpolypeptide linker at the C-terminus of the module providing 
the nascent polyketide chain in order to assure orderly transfer. 

[0072] In the discussion that follows, polyketide synthases are discussed either at the 
protein level or the DNA level. As is well understood, manipulation of the sequence of 
amino acids in the polyketide synthase proteins is most conveniently done using 
recombinant techniques. Thus, for example, the appropriate linker sequences can be 
introduced to or modified with respect to those of an existing module by modifying the 
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appropriate gene and expressing it in a suitable host. Interchange of linkers is also 
conveniently done in this manner. Further, modifications of amino acid sequences so as 
to obtain "variants" are effected by mutating the gene. The referent polyketide synthase 
should be understood to exist at both the protein level and nucleic acid level, and which 
form is being discussed should be apparent from the context. 

[0073] Further, the action of polyketide synthases on their appropriate substrates can 
be effected either extracellularly by using isolated enzymes or may be effected by 
producing the enzymes intracellularly. By "appropriate substrate" is meant the extender 
units in their thioester forms that are recognized by the various modules in the PKS and 
"starter" units which are either thioesters of carboxylic acids or partially synthesized 
polyketides such as diketides. For example, as described in PCT application 
PCT/US96/1 1317, the ketosynthase domain of module 1 may conveniently be inactivated 
thus making more efficient the utilization of the diketide by module 2. 
[0074] The linkers can be supplied by conventional recombinant DNA manipulations 
through the use of restriction enzymes and ligation procedures commonly practiced. The 
linkers in the PKS of the invention will be "isolated" from their natural environments. 
By "isolated," as used herein, is meant simply that the referent is found linked in 
association with moieties with which it is not normally associated, or in an environment 
in which it is not naturally found. It may be linked, if a nucleotide sequence to additional 
sequence with which it is not normally linked, or, if a peptide, to additional amino acid 
sequence with which it is not ordinarily linked, or it may be simply detached from 
additional moieties with which it is usually associated. 

[0075] As seen from Figure 3, the intrapeptide linkers (RAL) of the invention contain 
approximately 16-20 amino acid and typically contain a proline residue at approximately 
the middle of the sequence. On the other hand, the N-terminal upstream interpolypeptide 
linkers are approximately twice as long and appear to contain conserved acidic amino 
acid residues and basic amino acid residues at positions in the upstream half of the 
molecule. Thus, typical N-terminal upstream interpolypeptide linker (ERL) will contain 
an acidic amino acid within the first 3-10 residues, which is followed after 8-10 residues 
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by a basic amino acid, and then after another 2-5 amino acid residues by an additional 
acidic amino acid. Additional acidic and basic residues may also occur in these linkers. 
[0076] The intrapeptide linkers or interpeptide linkers shown in Figure 3 can be used 
as described below in the present invention or the corresponding amino acid sequences 
from native Type I PKS in general can be employed. In addition to the sequences that 
occur in nature, "variants" may be used. These variants are obtained by altering the 
amino acid sequence of the linker in minor ways that do not affect the ability of the linker 
to "feed" the nascent polyketide chain to the module in question. Typically, such 
"variants" are obtained from the native sequences by amino acid substitution, deletion or 
insertion; preferably the substitutions are "conservative" substitutions - i.e., an acidic 
amino acid for a different acidic amino acid, a basic amino acid for a different basic 
amino acid, and the like. Preferably, the variants contain no more than three amino acid 
alterations, preferably only two, and more preferably only one. 
[0077] For construction of polyketide synthases which contain more than one 
polypeptide, the appropriate sequence of transfers is assured by matching the appropriate 
C-terminal amino acid sequence of the donating module with the appropriate N-terminal 
amino acid sequence of the interpolypeptide linker of the accepting module. This can 
readily be done, for example, by selecting such pairs as they occur in native PKS. For 
example, two arbitrarily selected modules could be coupled using the C-terminal portion 
of module 4 of DEBS and the N-terminal of portion of the linking sequence for module 5 
of DEBS. 

[0078] In general, the method of the invention involves supplying to a module used in 
a PKS for synthesis of a desired polyketide with the appropriate N-terminal upstream 
portion interpolypeptide linker (N-ERL), C-terminal downstream portion of an 
interpolypeptide linker (C-ERL) or with an intrapolypeptide linker (RAL) at either 
terminus. As stated above, if the module is at the N-terminal portion of a polypeptide, an 
N-terminal upstream interpolypeptide linker should be appended at its N-terminus. If the 
module resides in a polypeptide where there is an additional module fused upstream from 
it, the two modules should be separated by an intrapolypeptide linker. 
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[0079] For ease of construction, a library of functional modules can be maintained to 
provide the appropriate desired module for construction of the PKS. One way to ensure 
the appropriate sequence of polyketide chain growth is to link the modules covalently, so 
that all but the first module will contain upstream intrapolypeptide linkers. Alternatively, 
and preferably, appropriate communication between functional modules non-covalently 
associated on separate polypeptide molecules can be achieved by providing appropriate 
matching between the C-terminal downstream portion of the interpolypeptide linker 
associated with the module contributing the nascent polyketide chain and the N-terminal 
upstream portion of the interpolypeptide linker placed upstream of the module which 
accepts and extends this nascent polyketide. Thus, an appropriate linker to ensure that 
the growing polyketide chain will be passed from module A to module B, which modules 
are not covalently bound, would be to couple, for example, the C-terminal scaffold 
portion of module 4 from erythromycin to module A and the N-terminal interpolypeptide 
linker (scaffold) portion from module 5 of the erythromycin PKS to the N-terminus of the 
KS of module B. 

[0080] To design and construct the PKS, one straightforward approach is to utilize 
the existing linker regions of a native PKS, such as erythromycin PKS, and simply to 
"plug in" modules, for example from a library. 

[0081] A library of modules derived from naturally occurring PKS which contains 
modules incorporating all alternative extender units used in native PKS combined with all 
variants of beta-carbonyl modification is not large. Extender units that are incorporated 
naturally include malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and 
hydroxymalonyl-CoA. The appropriate native molecule for incorporation of each of 
these can readily be found. Methylmalonyl-CoA extender units are incorporated, for 
example, by the modules of the erythromycin PKS. Certain modules of the picromycin 
PKS incorporate malonyl-CoA, while modules of the epothilone PKS incorporate 
ethylmalonyl-CoA or hydroxymalonyl-CoA. Modules occur naturally which contain the 
full spectrum of beta-carbonyl modifying activities; to the extent it is desirable to couple 
a particular beta-carbonyl modifying activity with a particular extender specificity, this 
can be accomplished by altering catalytic domains, per se, as described in the above- 



sd-81206v2 



-17- 



PATENT 

Atty. Diet.: 300622004620 



referenced PCT publication WO 98/493 15. The complete combination of extender unit 
choices with all beta-carbonyl modification choices is thus only a total of 4x4 or 
16 modules. As the KS unit determines the stereoselectivity of the module, 
accommodation can be made for various stereoisomeric forms of precursor by adjusting 
the KS domain in the module library. This expands the total number of modules 
necessary only to 32. An arbitrary number of modules can be included in a particular 
PKS construct, thus also determining the length of the polyketide chain and the size of 
the macrolide product. Of course, the macrolide product can be modified, if desired, by 
the known tailoring enzymes which convert naturally occurring macrolides to 
hydroxylated and/or glycosylated forms and the like. Such modification can be achieved 
in a variety of ways - by chemical modification, by in vitro treatment with appropriate 
enzymes, or by feeding the polyketides to a host organism which contains the appropriate 
tailoring enzymes, as is well understood in the art. 

[0082] To construct the desired PKS, modules are selected from the library and 
provided the appropriate upstream intrapolypeptide or interpolypeptide linkers. Suitable 
linkers can be selected from the group consisting of those shown in Figure 3, the 
corresponding sequences from any Type I PKS, or can include variants of these depicted 
sequences which are conservative in nature and do not interfere with the ability of the 
linker to permit effective uptake of the nascent polypeptide chain. The linkers can be 
added by standard recombinant techniques to the modules in the library, or, the library 
can be composed of the collection of modules wherein each module has been further 
manipulated to include either an intrapolypeptide or interpolypeptide linker. It may be 
desirable, for example, to provide each type of extender module with an intrapolypeptide 
linker, including the possibility of retaining the linker that is normally associated with it. 
If the linker is placed at the N-terminus of the module, the module is suitable for covalent 
linking downstream of an additional module in a single polypeptide. If the 
intrapolypeptide linker is at the C-terminus, ordinarily that module will be placed and 
linked covalently upstream of an additional module. In any case, a module may 
arbitrarily be provided with an intrapolypeptide linker (RAL) at either its N- or 
C-terminus depending on where it is ultimately to be placed in the PKS to be constructed 
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or may be provided with the N-terminal upstream portion of an interpolypeptide linker 
(N-ERL) if it is to be placed at the N-terminus of a polypeptide in the PKS, or with a 
C~terminal portion of an interpolypeptide linker (C-ERL) if it is to be placed at the 
C-terminus of a polypeptide and is intended to transfer a nascent polyketide chain to a 
subsequent module. 

[0083] The various modules, with appropriate linkers are then assembled into the 
desired polyketide synthase. As stated above, the construction of the PKS can be based 
on plugging in active portions of modules into an existing linker array The assembly can 
be performed by simply mixing the peptides containing the modules or may be generated 
recombinantly from expression constructs in a host cell. The cell may provide the 
appropriate substrates for the PKS, or the substrates may need to be provided to the 
reaction mixture containing the polypeptides or to the cells in which they are generated. 
Depending on the choice of host, provision may need to be made for providing these 
substrates. 

[0084] In this way, the modules can be "mixed and matched" as desired to construct a 
polyketide product from the desired extender units and with the desired beta-carbonyl 
modification, choosing the linkers in accordance with the position of the module in a 
polypeptide, and the number of modules cam be altered as desired. 
[0085] A preferred starter unit for such an assembly of modules is a diketide thioester 
either formed in situ by including a module which contains a loading domain to 
incorporate a starter unit along with an extender unit to attain this resultant, or the 
diketide may be synthesized independently and used as the substrate for the PKS. The 
synthesized diketide may be supplied as the thioester, such as the N-acylcysteamine 
thioesters. Preparation methods for these thioesters are described in the above-referenced 
U.S. Serial No. 09/346,860 filed 2 July 1999 and the corresponding PCT application, as 
well as U.S. Serial No. (Atty. docket No. 30062-20032.00) filed 27 January 2000. 
[0086] Using the techniques of the invention, it is thus possible to manipulate entire 
modules and effect efficient cross-talk so as to assure production of the desired 
macrolide. Such techniques can be used, for example, to alter the structure of macrolide 
anti-infectives by, for example, replacing the module 2 of the erythromycin gene cluster 
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with module 2 of the tylosin gene cluster, or replacing the erythromycin module 6 (along 
with its thioester sequence) with the corresponding module 6 from narbomycin. 
[0087] In addition, 14-membered macrolides could be expanded to become 16- 
membered macrolides by fusing modules 2-3 of the tylosin, spiramycin or niddamycin 
modules 2-3 between modules 1 and 3 of the erythromycin synthase or by adding any 
arbitrarily chosen module from other Type I PKS clusters into the synthase for 
production of erythromycin. Alternatively, modules 1-2 of erythromycin could be 
deleted and replaced by modules 1-3 of tylosin, spiramycin or niddamycin. 
[0088] In addition, new substituents can be introduced into, for example, PKS 
erythromycin or its precursors by replacing the second module of the erythromycin PKS 
with module 5 from tylosin PKS where the substituted module has the enoyl reductase 
catalytic activity inactivated. This results in erythromycins substituted with an ethyl 
group at the 10-position. Alternatively, erythromycin module 5 could be replaced by the 
spiramycin module 6 to obtain 5-desmethyl-4-OH erythromycins. 
[0089] Improved forms of FK-506 are obtained by replacing rapamycin modules 2-10 
with FK-506 modules 2-6, or by replacing rapamycin modules 2-1 1 with FK-506 
modules 2-7 or by replacing rapamycin modules 2-12 with FK-506 modules 2-8 or by 
replacing rapamycin modules 1 1-14 with FK-506 modules 7-10. Any combination or 
subset of the above could also be employed. Improved forms of FK-520 can be made in 
a similar manner. An alternative form of rapamycin is synthesized by substituting the 
FK-506/520 module 1 for rapamycin module 1. 

[0090] The foregoing are merely exemplary of the types of manipulations that could 
be employed. The polyketides, obtained by supplying the appropriate substrates either 
in vitro or in vivo, may then be further modified if desired by hydroxylation, 
glycosylation and the like to obtain desired products. Further, chemical synthetic 
manipulations may also be employed. 

[0091] Some of the resulting compounds described above could be prepared by 
alternative techniques previously disclosed, for example, in PCT applications 
PCT/US99/22886 or PCT/US99/24483. However, the procedure described above, which 
manipulates entire modules, may result in better yield or more convenient synthesis. 
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[0092] In addition to housing six modules, the three polypeptides of DEBS each 
possess short, nonconserved segments of amino acid residues located at the N- and C- 
termini of adjacent polypeptides (shown with complementary symbols in Figure 4 and 
5 described in Example 6 below). A previous study has discussed the importance of 

keeping these interpolypeptide "linkers" intact during the engineering of chimeric PKSs 
(Gokhale, R.S., et al. (1999) Science 284, 482-485. To dissect the precise role of these 
linkers in mediating intermodular interactions, an in vitro system consisting of a donor 
module and an acceptor module was developed and kinetically characterized and is 
y f% 10 described in Example 6 below. Each of the components of this functional linker pair 

™f could then be replaced with counterparts from other naturally occurring linker pairs. The 

yj results of these experiments, shown in Example 6 and accompanying figures 4-9, support 

fl | the understanding that the linker regions outside a module's conserved catalytic domains 

% impact its interactions with other modules. In addition to their importance in functionally 

:sf?». 

* 1 5 connecting modules, these short sequences also play an active role in the protein-protein 

in recognition of modules, helping to maintain the selective transfer of intermediates. 

C3 [0093] There are several strategies for rationally manipulating polyketide structure by 

Q engineering DEBS. For example, it has been demonstrated that DEBS is amenable to the 

M introduction of unnatural side chains at the C13 and Cn positions via precursor-directed 

20 feeding of diketides (Jacobsen, et al., Science 1997, 277, 367-369; Jacobsen, et al., 

Bioorg.Med. Chem. 1998, 6, 1171-1177; Hunziker, et al, Tetrahedron Lett 1999, 40 9 
635-638), as well as via replacement of loading didomains from alternative synthases. 
See Marsden, et al., Science 1998, 279, 199-202). In addition, protein engineering of 
DEBS can generate truncated polyketides, (Kao, et al., Am. Chem. Soc. 1994, 776, 11612; 
25 Cortes, et al, Science 1995, 268, 1487-1489; Kao, C. M., et al., J. Am. Chem. Soc. 1995, 

777, 9105-9106; Kao, et al.,. Am. Chem. Soc. 1996, 778, 9184) epimerized polyketides 
(Bohm, et al., Chem Biol 1998, 5, 407-412; Kao, et al, J. Am. Chem. Soc. 1998, 720, 
2478-2479; Holzbaur, et al, Chem. Biol. 1999, 6, 189-195; Bycroft, et al, Biochem. 
2000, 261, 520-526), desmethyl polyketides (Oliynyk, et al., Chem. Biol 1996, 3, 833- 
30 839; Ruan, et al., 1 Bacterid 1997, 779, 6416-6425; Liu, et al., Am. Chem. Soc. 1997, 
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119, 10553-10554; Lau, et al., Biochemistry 1999, 38, 1643-1651), polyketides 
containing various degrees of modification of the |6-keto groups (Donadio, et al., Science 
1991, 252, 675-679; Donadio, et al., Proc. Natl. Acad. Sci. U.S.A 1993, 90, 7119-7123; 
Bedford, et al., 1996, 3, 827-831; McDaniel, et al., Am. Chem. Soc. 1997, 119, 4309- 
5 4310; Kao, C. M., et al., Am. Chem. Soc. 1997, 119, 1 1339-1 1340), and combinations 

thereof. See McDaniel, et al., Proc. Natl. Acad. Sci. U.S.A 1999, 96, 1846-1851. 
However, one approach for generating diversity in polyketides that has been exploited 
only to a limited extent (Gokhale, et al., Science 1999, 284, 482-485; Ranganathan, et al., 
Chem. Biol. 1999, 6, 731-741) is the fusion of intact modules (or groups thereof) from 
1 0 different PKSs to generate chimeric assembly lines. While the application of such a 

f~t strategy takes advantage of the natural catalytic grouping of the modules to produce 

enzymes of improved catalytic effectiveness, two major issues must be addressed to 

M» rationally implement a modular rearrangement strategy for combinatorial biosynthesis. 

3 First, the molecular recognition features of individual modules need to be deciphered, so 

+*• 1 5 that their placement in hybrid PKSs can be restricted to catalytically productive contexts. 

0 Second, the mechanistic basis for transferring intermediates between adjacent modules 

% must be understood, so that intermodular chain transfer can efficiently occur between 

heterologous modules. This report provides new insights into the relative importance of 

Jy both of these issues and their interrelationships in the context of a multimodular PKS. 

20 [0094] The tolerance and specificity of individual modules of DEBS have been 

indirectly investigated using a variety of genetic, biochemical, and chemical 
approaches. 25 Recently, it has been possible to express and reconstitute individual DEBS 
modules as intact proteins. See Gokhale, et al., Science 1999, 284, 482-485. This 
allowed us to directly assess the substrate specificities of four modules of DEBS 
25 (modules 2, 3, 5, and 6) using a set of N-acetylcysteamine (NAC)-activated diketides as 

potential substrates (2a-d, Figure 10A), Wu, et al., Am. Chem. Soc. 2000, 722, 4847- 
4852. Surprisingly, not only did the substrate specificity profiles of these four individual 
modules turn out to be quite similar, but these profiles also did not correlate well to the 
structures of the natural substrates of individual modules. Separately, recent experiments 
30 have suggested that short intermodular linker sequences play an important role in the 
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selective transfer of polyketide intermediates between modules. See Gokhale, et al, 
Science 1999, 254, 482-485; Tsuji, et al., Biochemistry 2001. Therefore, we considered it 
appropriate to reexamine the steady-state kinetic parameters of individual DEBS 
modules, but this time, to pay closer attention to possible protein-protein interactions that 
could be involved in passing a substrate from an upstream module to its downstream 
neighbor. 

[0095] There are two modes by which a substrate can be passed from one module to 
the next. If the two successive modules are on the same polypeptide (such as modules 1 
and 2 of DEBS), there is an intrapolypeptide chain transfer. On the other hand, if the two 
successive modules are on separate polypeptides (such as modules 2 and 3 of DEBS), 
there is an interpolypeptide chain transfer. In either case, biosynthetic intermediates 
undergo direct interthiol transfer between adjacent modules such that the intermediates 
never go into bulk solution. We refer to this property as the "physical channeling" of 
intermediates between modules. 

[0096] Physical channeling (also commonly referred to as substrate channeling) is 
defined as a mechanism in a sequence of reactions in which reaction intermediate is 
transferred from one active site to the downstream active site without equilibrating with 
the bulk solution. SeeKirsch, et al., Biochemistry 1999, 38, 8032-8037. Physical 
channeling of intermediates can provide kinetic benefits by increasing the effective 
concentration of the substrate, protecting labile intermediates from unproductive 
reactions, and precluding entrance of intermediates into competing enzymatic pathways. 
Furthermore, substrate channeling between two enzymes can help overcome product 
inhibition of the upstream enzyme by tunneling the intermediate out of the upstream 
binding pocket and into the downstream binding pocket more efficiently. 
[0097] While physical channeling is a necessary outcome of fundamental polyketide 
biosynthetic mechanisms (Donadio, et al., Science 1991, 252, 675-679; Cortes, et al., 
Nature 1990, 348, 176-178), the kinetic advantage, if any, of channeling intermediates 
between modules has not yet been resolved. To elucidate the issue of "kinetic 
channeling" (which is defined as physical channeling that results in a kinetic advantage- 
as measured by k cat sover a diffusive loading mechanism in which the intermediate 
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equilibrates in the bulk phase after release from the upstream active site and before 
loading in the downstream active site) in modular PKSs, two new assay systems-one to 
probe intrapolypeptide transfers and one to probe interpolypeptide transfers-were 
devised that would more accurately mimic the transfer of a substrate from the acyl carrier 
protein (ACP) of one module to the ketosynthase (KS) of the next. These assays are 
described in further detail in Example 7 below. In the first assay system, the loading 
didomain and module 1 of DEBS generated in situ the natural diketide intermediate 
((2iS r 9 3i?)-2-methyl-3-hydroxy-pentanoyl-S-ACPi) ? which could then be transferred to 
alternative downstream modules in a bimodular PKS context (Figure 10B). By 
comparing the kinetic parameters of these hybrid bimodular systems to those for 
elongation of the same diketide that has been supplied exogenously to the isolated 
downstream module (Figure 10A), the kinetic benefit of channeling intermediates 
between covalently linked modules could be evaluated. A second assay system was 
developed using a chemoenzymatic method, through which alternative diketides were 
covalently attached to the phosphopantetheine arms of an individually expressed donor 
ACP domain (Figure 10C). Here, the entire diketide-S-ACP adduct (4a-d) is a formal 
substrate for a recipient module, therefore allowing investigation of interpolypeptide 
channeling. (The linker sequence at the C-terminal end of the ACP as previously 
described, see Tsuji, et al., Biochemistry 2001, was included in this construct.) By 
attaching different diketides to the same ACP, the steady-state kinetic parameters for 
diketide elongation by individual modules (each with a TE domain fused to its C 
terminus to facilitate turnover) could be measured. Both assay systems were used to 
compare the properties of modules 2, 5, and 6 of DEBS, three modules that perform the 
same chemistry with identical stereocontrol, albeit on very different substrates (Figure 4). 
The results of these studies are described in Example 7 below. 

[0098] The present invention is further described by the compounds and methods 
described in the following examples. The examples are provided solely to illustrate the 
invention by reference to specific embodiments. These exemplifications, while 
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illustrating certain specific aspects of the invention, do not portray the limitations or 
circumscribe the scope of the disclosed invention. 



sd-81206v2 



-25- 



PATENT 

Atty. Dkt.: 300622004620 



Preparation A 
Construction of Single Module Based Systems 

Single Module Gene Constructs 

[0099] Single module constructs from the DEBS gene cluster were prepared for 
5 modules 2, 3, 5 and 6 as follows. The TE domain is fused to the module to facilitate 

termination. The (M3+TE) gene was prepared from the tri-modular construct pKA031 8 
(McDaniel, R., et al, Chem. Biol (1997) 4:667) having an Nhel site engineered at the 
start of the DEBS-2 gene. Fusion of the TE gene at the end of ACP3 was described in 
connection with the construction of pCK13 in Cortes, J., et al, Science (1995) 268:1487; 
K 10 Kao, CM., et al, J. Am. Chem, Soc. (1995) 117:9105 and Kao, CM., et al, J. Am, Chem. 

O Soc (1996) 1 18:9184, collectively cited below as the "Cortes-Kao documents." The 

y, Nhel-EcoRI fragment was cloned into pET 21c (Novagen) to construct pRSG34. The 

% EcoRI site was used to delete the stop codon of the TE domain so that the protein could 

42 be overproduced as a carboxy terminal (His)6 tagged fusion protein, 

f 1 1 5 [001 00] (M5+TE) was constructed by combining the engineered Ndel site from 

W pJRJIO (Jacobsen, etal, Biochem (1998) 37:4928) with the EcoRI site frompCKIS 

4S (Cortes-Kao documents). The Nde-EcoRI fragment was cloned in pET21c to obtain the 

Jfl expression plasmid pRSG46. Expression constructs for (M2+TE) and (M6+TE) were 

prepared similarly using an engineered Nhe site immediately upstream of the 
20 corresponding KS (at position 7570, 5 -GCTAGCGAGCCGATC-3 ' (SEQ ID NO:l) and 

at position 28710, 5 '-GCTAGCGACCCGATC-3 ' (SEQ ID NO:2)). 
[00101] These constructs were expressed in E. coli BL21 (DE3) along with an 
expression system for sfp phosphopantetheinyl transferase from B. subtilis. The 
co-expression is described by Lambalot, R.H., et al, Chem. Biol (1996) 3:923. For the 
25 construction of the sfp gene, the Ndel-Hindlll fragment derived from the pUC8-.s#? 

(Nakano, et al, Mol Gen. Genet. (1992) 232:313) was cloned into pET28 which has a 
kanamycin resistance gene to give resultant plasmid pRSG56. The resulting proteins 
were then isolated for use in the reaction mixtures described in the Examples below. 
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[00102] In more detail, the expression was induced with 1 mM 
isopropyl-b-D-thiogalactopyranoside, and the cells were harvested by centrifugation after 
1 0 hours and resuspended in disruption buffer, 200 mM sodium phosphate pH 7.2, 
200 mM sodium chloride, 2.5 mM dithiothreitol, 2.5 mM sodium ethylenediamine tetra- 
acetate (EDTA), 1.5 mM benzamidine, 2 mg/L pepstatin and leupeptin and 30% v/v 
glycerol. The cells were lysed by passing through a french press, and the lysate was 
collected after centrifugation. Nucleic acids were precipitated with polyethylenimine 
(0.15%) and removed via centrifugation. The supernatant was made 50% (w/v) saturated 
with ammonium sulfate and precipitated overnight. After centrifugation, the pellet 
containing protein was redissolved in buffer A (100 mM sodium phosphate pH 7.2, 
2.5 mM DTT, 2 mM EDTA and 20% glycerol (v/v)) and stored at -80°C. For 
chromatography, the buffer was exchanged to buffer A + 1 M ammonium sulfate using a 
gel filtration PD10 (Pharmacia) column. The resulting sample was loaded on a Butyl 
Sepharose (Pharmacia) column. Fractions containing DEBS proteins were pooled and 
applied on an anion exchange column (Resource Q; 6 mL, Pharmacia). Purified protein 
fractions were pooled and concentrated using Amicon centriprep30. Typical purified 
protein yields were ~ 3-4 mg/liter of culture. Greater than 90% of proteins were 
phosphopantetheinylated in vivo as a result of the overexpression of sfp 
phosphopantetheinyl transferase. Although the proteins were expressed as (His)6-tagged 
proteins, they did not bind to a Ni-column under experimental conditions. It is unclear 
whether this inability to bind to a Ni-agarose column is due to steric effects or if the 
(His)6 peptide was lost during purification. 

Example 1 

Requirements for Cell-Free Synthesis of Triketides by Individual Modules - 
Identification of Linker Regions 

[00103] A cell-free system, tested for the ability to convert the cysteamine thioester of 
2S,3R-2-methyl-3-hydroxypentanoic acid (compound 2 in Figure 2), consisted of 
0.5-10 mM concentration of the thioester, 2.5 mM 14 C-labeled methylmalonyl CoA and 
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100 pmoles of purified protein prepared in Example 1 in a 100 \i\ reaction. In some 
cases, 1 mM of NADPH was added; this was done in assay mixtures containing 
(M2+TE), (M5+TE) and (M6+TE) proteins. The protein was, in each case, a single 
module of the DEBS PKS fused to the thioesterase (TE) termination region of module 6. 
[00104] The reaction mixtures were quenched and extracted by ethyl acetate and 
separated by thin-layer chromatography (TLC) to discern the formation of the triketide 
ketolactone 3 and triketide lactone 4 (both shown in Figure 2) after 30 minutes. Results 
are shown as the first four entries in Table 1 . 

Table 1 

Formation of Triketides by Single Module Constructs 





Construct 


Plasmid 


Triketide Formed 


1. 


M3+TE 


pRSG34 


Yes 


2. 


M5+TE 


pRSG46 


Yes 


3. 


M2+TE 




No 


4. 


M6+TE 




No 


5. 


ERL-M2+TE 


pRSG64 


Yes 


6. 


ERL-M6+TE 


pRSG54 


Yes 


7. 


M1-RAL-M6-TE 


pST96 


Yes 


8. 


M1-RAL-M3-TE 


pST97 


Yes 


9. 


ery Ml-RAL-rz/M5+TE 


pSTllO 


Yes 


10. 


(ery Ml-RAL n/M5+ERL)+DEBS-2+DEBS-3 


pST113 


Yes (dEB6) 



[00105] As seen in Table 1 ? although the expected triketides were formed from 
(M3+TE) and (M5+TE) (modules which reside at the upstream portion of their respective 
polypeptides), no triketides were formed from (M2+TE) or (M6+TE), (modules which 
reside at the C-terminal portions of their polypeptides). These latter results were 
unexpected since the diketide can be incorporated by module 2 when it is supplied as a 
part of the complete polypeptide DEBS-1 . It was verified that the ACP domain was 
pantetheinylated in modules 2 and 6, and that for (M2+TE), the KS domain could not be 
acylated with radiolabeled diketide. 
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Example 2 

Modification of Single Modules with Linker Sequences 



[001 06] The constructs for (M2+TE) and (M6+TE) were modified by deleting the 
sequences encoding the amino acids upstream of the KS catalytic domain and substituting 
the first 39 amino acids from (M5+TE) containing the N-terminal portion of the 
interpolypeptide linker (N-ERL). The relevant constructs were prepared by replacing the 
BsaBI-EcoRI fragment in pRSG46 by the corresponding fragment from pCK4 to obtain 
(N-ERL-M2+TE), in plasmid pRSG64, or from pJRJIO to obtain (N-ERL-M6+TE) in 
plasmid pRSG54. These constructs yield modules which contain the upstream 39 amino 
acids from module 5. The constructs were expressed in E. coli and proteins obtained as 
described in Preparation A. These proteins were able to produce the triketide product 
from diketide in the cell free system of Example 1, as shown in entries 5 and 6 in Table 1. 
[00107] The various constructs which are successful in converting diketide to triketide 
were then evaluated for the kinetic constants kc a t and Km- These results are shown in 
Table 2. As shown in Table 2, the results are quite similar for all constructs except that 
the results from module 3 show a several-fold decrease in kc at as compared to the other 
modules. This is evidently due to the absence of beta-carbonyl modifying enzymes in 
module 3 as verified by the fact that removal of NADPH, (which is required for the 
activity of such modules) from the reaction mixture of (N-ERL-M6+TE) also results in a 
lowering of the k cat . 



Table 2 



Proteins 



k cat X100 (mm 1 ) 



K M (mM) 



(N-ERL-M2+TE) 



8 ±0.6 



4.6 ± 0.4 



(M3+TE) 



1.5 ±0.3 



4.4 ±0.4 



(M5+TE) 



7.5 ± 0.7 



4.7 ±0.4 



(N-ERL-M6±TE) 



9.5 ±0.6 



4.3 ± 0.4 



(N-ERL-M6+TE) 



4.5 ± 0.7 



4.1 ±0.4 
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(- NADPH) 

[00108] It is apparent from these results that the presence of the N-terminal upstream 
sequence associated with modules located at the N-terminal portion of the polypeptide is 
essential for permitting a module in this position to incorporate the growing polyketide 
chain. 

Example 3 

Construction of (M1-RAL-M3+TE) and (M1-RAL-M6+TE) 

[00109] The BsaBI-EcoRI fragments containing modules 3 and 6 respectively were 
cloned behind the Ml module which contains the intrapolypeptide linker (RAL) that 
natively resides between Ml and M2. The resulting M1-RAL-M3+TE and 
M1-RAL-M6+TE genes were then excised as PacLEcoRI fragments and inserted into 
pCK12 resulting in plasmids pST97 and pST96 respectively. The corresponding proteins 
were produced by transformation into S. coelicolor CH999. The resulting strains of 
S. coelicolor were able to incorporate the diketide thioester into the triketide as shown by 
entries 7 and 8 in Table 1. (The triketide produced is the ketolactone 3 in Figure 2.) 

Example 4 

Additional Intrapolypeptide Mediated Transfer 

[00110] A construct wherein the first module of the DEBS PKS cluster (ery), which 
contains the intrapolypeptide linker of the corresponding M1-M2 polypeptide from the 
erythromycin PKS, is fused to the fifth module of the rifamycin PKS (rif) was 
constructed by replacing the natural sequence at 28024 of rif ACP5 (5 '-CGCGAC-3 ') 
with the Spel recognition sequence 5 '-ACTAGT-3 ' . The BsaBI-Spel fragment 
containing rif MS was excised and replaced the corresponding ery Ml -RAL- fragment in 
pCK12 to obtain plasmid pSTl 10. This plasmid, containing ery Ml-RAL-rz^MS+TE 
was transformed into S. coelicolor CH999 and the resulting strain was able to incorporate 
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the diketide into the triketide lactone as shown by entry 9 in Table 1 . The amount is 
comparable to that produced in this strain transformed with DEBS-1+TE. 

Example 5 

Construction of Modules for Intermolecular Transfer 

[00111] The Pacl-Spel fragment of pSTl 10 was inserted into a derivative of pCK7 
(Kao, CM., et al, Science (1994) 265:509) which had an SpelsiXe engineered at the 
beginning of the scaffolding sequence at the carboxy terminus of the polypeptide 
downstream of ACP2. The resulting pSTl 13 construct still contains ery Ml linked to 
n/M5 via the natural intrapolypeptide linker between ery molecules 1 and 2, and also 
now contains rifMS covalently linked to the downstream C-terminal portion of the ERL 
derived from ery M2. Thus, the complete ERL between the polypeptide generated by 
pSTl 13 and the protein generated by a construct which generates DEBS-2 would 
correspond to the native ERL in the ery PKS - i.e., rifMS would be associated with ery 
M3 via the natural interpolypeptide linker between ery molecules 2 and 3. Co- 
transformation into S. coelicolor of pSTl 13 along with constructs that produce DEBS-2 
and DEBS-3 results in the production of 6-dEB, as shown by entry 10 of Table 1. 

Example 6 

Construction of Modules for Interpolypeptide Transfer With Matched and Mismatched 
Linker Pairs (M2 and M3+TR M2 and (5)M3+TR M2(4) and M3+TE, and M2(4) and 

(5)M3+TE) 

[00112] Reagents and Chemicals. DL-[2-me%/~ 14 C]Methylmalonyl-CoA (56.4 
mCi/mmol) was obtained from ARC, Inc. 

[00113] The N-terminal linker of M3 was synthesized by New England Peptide 
(Fitchburg, MA). The peptide sequence was as follows, M3 N-term: H 2 N- 
MTDSEKVAEYLRRATLDLRAARQRIRELESD-amide. 
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[00114] Construction of Plasmids. Plasmid pBP19 contains module 2 of DEBS 
(M2) and is a derivative of pRSG64 (Gokhale, R. S., et al, (1999) Science 284, 482-485), 
where the thioesterase domain was replaced with a Spel-EcoRI fragment containing the 
natural C-terminal linker for module 2 to make pBP19. Plasmid pST179 encodes a 
derivative of M2 containing the C-terminal linker of DEBS module 4 (M4). The C- 
terminal linker of M4 was obtained as a Spel-EcoRI fragment by PCR using the primers 
5 -ACT AGT AGG CTG TTC GCG GCC TCA C-3' and 5 -G GGA ATT CAG GTC 
CTC TCC CCC GC-3' (bold sequences complement DEBS sequence). The PCR 
amplicon was inserted after M2 using the engineered sites, yielding pST179. This 
plasmid, pRSG34, encodes module 3 of DEBS (M3) with its own N-terminal linker and 
with the thioesterase fused to the C-terminus. Its construction has been described 
previously (id.). Plasmid pST132 encodes a derivative of M3 + TE, where the natural N- 
terminal linker of pRSG34 has been replaced with the N-terminal linker of module 5 of 
DEBS (M5). This substitution required the replacement of the Ndel-BsaBl fragment of 
pRSG34 with the corresponding fragment from pJRJIO (Jacobsen, J. R, et al, (1998) 
Biochemistry 37, 4928-4934). All constructs were cloned into pET-21c (Novagen) 
vectors for expression in Escherichia coli. 

[001 1 5] Strain and Culture Conditions. Expression of the desired proteins was 
achieved by transforming the above plasmids into an engineered strain ofE. coli 
BL21(DE3) containing the sfp phosphopantetheinyl transferase gene from Bacillus 
subtilis (Lambalot, R. H., et al, (1996) Chem. Biol 3, 923-936). The sfp gene product 
was required to posttranslationally modify the acyl carrier protein (ACP) domains by 
phoshopantetheinylating the apo-ACP (Gokhale, R. S., et al, (1999) Science 284, 482- 
485). Cells containing the expression plasmids were selected with carbenicillin and used 
to inoculate a 10-20 mL LB medium starter culture grown at 37°C. After 6 h, the cells 
were pelleted and used to inoculate two 2 L flasks containing 1 L of LB medium each. 
The flasks were shaken at 250 rpm at 37°C until the culture optical density at 600 nm 
(OD 600 ) was 0.6. The flasks were placed in a water bath to cool the cells to 22 C (ca. 10 
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min) and then induced with 0.5 mM isopropyl /3-D4hiogalactopyranoside at an OD 6 oo = 
0.8. Flasks were then stirred at 22-24°C for 10-14 h. 

[001 16] Purification of Proteins. After induction, the cells were harvested via 
centrifugation and washed in 50 mM Tris (pH 8) and 1 mM ethylenediaminetetraacetic 
5 acid (EDTA) before being resuspended in disruption buffer [200 mM sodium chloride, 

200 mM sodium phosphate, 2.5 mM dithiothreitol (DTT), 2.5 mM EDTA, 1.5 mM 
benzamidine, pepstatin and leupeptin (2 mg/L), and 30% (w/v) glycerol]. The cell 
suspension was lysed at 1250 psi using a French press and then centrifuged. 
Polyethylenimine was added to the supernatant to 0.15% to precipitate nucleic acids. 
O 10 Following the centrifugation (20 min at 33300g) to remove the nucleic acids, ammonium 

£jf sulfate was added to the supernatant until a 50% (w/v) saturation was achieved and 

fiJ allowed to precipitate for 2-3 h. The pellet following a 45 min centrifugation (33300g) 

j; was resuspended in buffer A [100 mM sodium phosphate (pH 7.2), 2 mM DTT, 1 mM 

* m EDTA, and 20% (v/v) glycerol]. The resulting suspension was applied in 2.5 mL 

ly 15 aliquots to a 9.1 mL gel filtration column (PD-10, Pharmacia) equilibrated with buffer B 

4; (buffer A+1M ammonium sulfate) and eluted in 3.5 mL of buffer B. This eluant was 

applied to a 30 mL hydrophobic-interaction column (Butyl-Sepharose 4 FastFlow, 
Pharmacia) at 1 mL/min. Elution was performed at 1 mL/min with stepwise changes in 
buffer starting from 100% buffer B, to 40%, 20%, and 0%. Steps were made when the 
20 absorbance at 280 nm approached baseline. Fractions were 10 mL, and those containing 

the protein of interest (typically eluted with 0% buffer B) were pooled and applied to an 
anion-exchange column (Resource Q, 6 mL, Pharmacia) at 1 mL/min. A gradient of 0- 
0.15 M NaCl in buffer A was run at 1 mL/min for 3 column volumes, followed by a 
gentle gradient of 0.15-0.30 M NaCl at 1 mL/min for 10 column volumes. Fractions of 2 
25 mL were collected, and those containing concentrated protein (typically 0.22-0.25 M 

NaCl) were pooled and further concentrated on Centriprep 50 membranes (50 kDa 
molecular mass cutoff; Amicon) to a concentration of 0.1-4 mg/mL Protein 
concentrations were measured via the modified Lowry assay (Sigma) and densitometric 
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analysis of SDS-PAGE gels stained with Coomassie Blue. On the basis of the 
densitometry data, all proteins were determined to be >90% pure. 

[00117] In Vitro Polyketide Production. Assays of individual modules contained 
1 .0/iM protein, 1-10 mM N-acetylcysteamine thioester of the "natural" (2S,3i?>2-methyl- 
3-hydroxypentanoic acid diketide (NDK) (1), 4 mM NADPH, 440 mM sodium 
phosphate, 1 mM EDTA, 2.5 mM dithiothreitol (DTT), and 20% w/v glycerol, pH 7.2, in 
80/iL. Reactions with M2 or M2(4) included 0.3 mM 14 C-methylmalonyl-CoA (specific 
activity adjusted to 10.4 mCi/mmol), and those with M3 + TE or (5)M3 + TE included 
0.5 mM 14 C-methylmalonyl-CoA (specific activity reduced to 1.1 mCi/mmol). [M2(4) 
refers to a derivative of M2 in which the C-terminal linker has been replaced with its 
counterpart from module 4, whereas (5)M3 refers to a derivative of M3 in which the N- 
terminal linker has been replaced with its counterpart from module 5 (see Figure 4).] 
Both concentrations of methylmalonyl-CoA were saturating, but different concentrations 
were needed due to the disparate rates of turnover between the modules with and without 
the thioesterase. The reaction mixtures were preincubated to 30°C and started with the 
addition of the methylmalonyl-CoA. The reactions were incubated at 30°C, and at three 
to four time points, 20/^L aliquots were removed and quenched with ethyl acetate, which 
was extracted twice (450/iL total) to isolate the polyketide products. The ethyl acetate 
layers were pooled, dried in vacuo, and applied to an analytical thin-layer 
chromatography (TLC) plate (Si250F; Baker). The TLC plate was developed using 50% 
ethyl acetate in hexanes as the mobile phase (triketide ketolactone 4, R f = 0.5; triketide 
lactone 3, Rf= 0.4) and then visualized by electronic autoradiography (Instantlmager, 
Packard Instruments). Product formation was quantified by comparison with standards of 
the labeled methylmalonyl-CoA. The identity of products 2, 3, and 4 has been 
unambiguously established in earlier in vitro enzymatic studies (Pieper, R., et al 9 (1995) 
J.Am. Chem. Soc. 117, 11373-11374; Pieper, R., et aL, (1997) Biochemistry 36, 1846- 
1851) and was reconfirmed by chromatographic comparison with authentic reference 
samples. 
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[00118] Assays of M2 and M3 + TE contained LOjuM M2 and 0.4-4/xM M3 + TE, 7 
mM NDK, 0.5 mM 14 C-methylmalonyl-CoA (specific activity reduced to 3.4 mCi/mmol), 
4 mM NADPH, 440 mM sodium phosphate, 1 mM EDTA, 2.5 mM DTT, and 20% w/v 
glycerol, pH 7.2, in 70/zL. Assays of M2(4) and (5)M3 + TE were identical, except they 
contained 0.5/xM M2(4), 0.5-5/xM (5)M3 + TE, and 0.4 mM 14 C-methylmalonyl-CoA 
(specific activity reduced to 6.2 mCi/mmol). The concentration of M2(4) was limiting in 
order to facilitate its saturation with (5)M3 + TE. The reactions were prewarmed at 30°C 
and initiated by the addition of the methylmalonyl-CoA. As described above, 20 fiL 
aliquots were removed at various time points and processed. Extracts loaded onto TLC 
plates were separated using either 80% ethyl acetate in hexanes or 5% methanol in 
dichloromethane, both of which allowed identification of the tetraketide lactone 2 and the 
triketide lactones 3 and 4. 

[001 19] Inhibition of Tetraketide Production, The ability of the synthetic peptide to 
inhibit the transfer reaction, and thus the production of tetraketide, was tested under the 
same reaction conditions described for the two module coincubations. The 
concentrations of M2 and M3 + TE were both LOjuM, and the concentrations of M2(4) 
and (5)M3 + TE were 0.5 and 1.0/xM, respectively. The only difference was the addition 
of the peptide at concentrations ranging from 1 to 100/iM to the assay containing M2 and 
M3 + TE [or alternatively M2(4) and (5)M3 + TE]. For greater accuracy, each inhibition 
assay was performed side by side with a control lacking inhibitor. The effect of the 
inhibitor was thus determined by dividing the inhibited rate by the control rate. 

[00120] Kinetic Analysis. For individual modules, the steady-state turnover number 
was determined from the time course of triketide formation, normalized to the 
concentration of protein. The dependence of the rate on substrate concentration was 
measured by varying the concentration of NDK while maintaining saturating levels of 
NADPH and methylmalonyl-CoA. From these data, the £ cat and K M were calculated by 
fitting the normalized v versus [S] plots to the Michaelis-Menten equation. 
[00121] For tetraketide formation, the rate of production of tetraketide was recorded 
for varying concentrations of M3 + TE [or (5)M3 + TE] at a fixed concentration of M2 
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[or M2(4)] and saturating concentrations of substrates. By fitting the rate dependence of 
tetraketide to a saturation curve, the maximal velocity (aw) of tetraketide production was 
determined, and was assumed to represent the case where every M2 homodimer was 
productively associated with an M3 + TE homodimer. Thus, the affinity of this protein- 
protein interaction could be calculated from the rate of tetraketide formation, as 
represented in Equation 1 

([M2], [M3], and [M2-M3] refer to the concentrations of M2, unbound M3 + TE, and the 
M2/M3 + TE complex, respectively). Since [M2-M3] is related to the K D of M2 and M3 
+ TE as shown in Equation 2, 

K - IM2MM3) 

which can be rearranged to yield Equation 3, 

[MZMM3] 

where [M2]o = total concentration of M2, the velocity of tetraketide production can be 
defined relative to the 

where v = & C at[M2-M3] and v max = £ cat [M2]o. Thus, fitting of the v versus [M3] plot 
(which is equivalent to the bound M3 + TE versus free M3 + TE plot used for Scatchard 
analysis) to Equation 4 allowed determination of the K D for M2 and M3 + TE association. 

[00122] CD Spectroscopy. The CD spectrum of the M3 N-terminal peptide was 
recorded in a 1-mm path-length cell at a sample concentration of IOOjuM in phosphate- 
buffered saline (PBS; 0.15 M KC1, 25 mM phosphate, pH 6.9). Measurements were 
made using an Aviv 62DS spectropolarimeter. Concentration was determined by tyrosine 
absorbance at 275 nm in 8 M guanidine hydrochloride. 
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[00123] Kinetic Analysis of Individual Modules. To directly measure the effect of 
linker replacement, individual modules with substituted linkers were kinetically 
characterized using the natural diketide (NDK) as substrate. The only difference between 
the M3 + TE and (5)M3 + TE proteins was that the former contained the N-terminal 
linker of M3, whereas the latter contained the N-terminal linker of M5. As shown in 
Figure 5, the modules displayed very similar kinetics. The calculated £ cat values were 
0.71 ±0.1 and0.68± 0.1 min" 1 and the £ M values were 2.8 ± 1.5 and 2.5 ± l.OjttM forM3 
+ TE and (5)M3 + TE, respectively. Similar experiments were performed to compare M2 
and M2(4), where the C-terminal linker of M2 had been swapped. Since the rate of 
turnover of NDK to triketide lactone 3 by module 2 lacking the fused thioesterase domain 
was low, however, only an approximate & ca t could be estimated [0.02 and 0.03 min" 1 for 
M2 and M2(4), respectively]. Thus, replacement of either the N-terminal or the C- 
terminal linker of a module does not appear to affect its intrinsic catalytic properties. 

[00124] Kinetic Analysis of M2-M3 Coincubations. Upon coincubation of M2 and 
M3 + TE in the presence of NDK, methylmalonyl-CoA, and NADPH, tetraketide lactone 
2 was formed (Figure 6) with a maximum rate constant of 0.27 ± 0.01 min" 1 . The & cat of 
M2 with M3 + TE was determined from the saturation curve determined using a fixed 
concentration of M2 and a variable concentration of M3 + TE and correlated well with an 
earlier measurement of 0.23 min" 1 for formation of the same tetraketide by DEBS1 and 
M3 + TE (Pieper, R., et aL, (1997) Biochemistry 36, 1846-1851). Thus, the M2 protein 
described here appears to be a viable alternative to DEBS1 as a donor of the triketide 
intermediate to M3 + TE. From the saturation curve, a K D of 1.1 ± 0.1 /iM for the M2-M3 
+ TE interaction could be estimated (Figure 7 A; see also Materials and Methods section). 
Again, this value compares well with the previously reported value of 2.6juM for DEBS1 
and M3 + TE (Gokhale, R. S., et al 9 (1999) Chem. Biol 6, 1 17-125). 
[00125] Analogous to the above study, coincubation of M2(4) with (5)M3 + TE 
allowed examination of the effects of the transplanted DEBS2-DEBS3 linker pair (Figure 
4) on chain transfer between modules 2 and 3. The efficiency of chain transfer in the 
presence of this alternative linker pair (Figure 6D) was established by measuring the 
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same parameters associated with synthesis of the tetraketide lactone 2 [& cat of 0.74 ± 0.06 
min 1 and aK D of 2.1 ± 0.4 (Figure 7B)]. 

[00126] Effects of Mismatched Linker Pairs. In contrast to the above studies with 
M2 and M3 + TE (Figure 6A) or M2(4) and (5)M3 + TE (Figure 6D), both of the 
noncomplementary coincubations [M2 with (5)M3 + TE (Figure 3B) and M2(4) with M3 
+ TE (Figure 6C)] produced the tetraketide lactone 2 at severely compromised rates. The 
highest rate constants that could be measured with either mismatched system were in the 
range of 0.02-0.03 min" 1 . Given these low rate constants, the dependence of rate on 
protein concentration could not be measured. As might be expected, the rate of 
production of ketolactone 4 increased markedly in both cases relative to the matched 
incubations above (data not shown), since module 3 remained active in these mismatched 
coincubations. 

[00127] Inhibition of Tetraketide Production by a Synthetic Peptide. Sequence 
analysis using the CoilScan program (Lupas, A., et al 9 (1991) Science 252, 1162-1164; 
Lupas, A., (1996) Methods Enzymol 266, 513-52) revealed that the N- and C-terminal 
interpolypeptide linkers of DEBS contained 15-20 residue segments with strong 
propensity to assume a coiled-coil structure. Since the N-terminal linker of module 3 is 
relatively short (31 residues), a peptide corresponding to this sequence was synthesized 
(see Materials and Methods). As shown in Figure 8, this linker could inhibit the 
formation of 2 in the presence of M2 and M3 + TE in a concentration-dependent manner. 
To test the specificity of this peptide inhibitor, a similar assay was also conducted in the 
presence of M2(4) and (5)M3 + TE. No inhibitory effect was observed at peptide 
concentrations up to 100juM (Figure 8). Furthermore, the peptide also showed no 
inhibitory effect upon individually assayed modules, including M3 + TE. Thus, isolated 
peptide linkers appear to be selectively capable of competing for their cognate module- 
bound partners without affecting their individual catalytic activities. 
[00128] The N-terminal linker peptide of M3 was analyzed via circular dichroism 
(CD) to assess its a-helical character. As shown in Figure 9, the spectrum shows the 208 
and 222 nm absorbances characteristic of a-helices. The magnitude of these peaks 
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allowed us to estimate that the peptide was approximately 50% helical (Chen, Y. H., et 
al, (1974) Biochemistry 13, 3350-3359). The CD of the peptide appeared invariant with 
peptide concentration, salt concentration, and phosphate concentration. 

[00129] Earlier studies suggested the role of structurally intact intermodular linkers in 
facilitating chain transfer between noncovalently associated modules of PKSs (Gokhale, 
R. S., et al, (1999) Science 284, 482-485). Here we have extended and elaborated these 
findings in several significant ways. First, our results have vividly demonstrated the 
selectivity associated with linker-mediated chain transfer (Figure 6). Although 
intermodular chain transfer can occur between modules possessing mismatched linker 
pairs, a serious kinetic penalty is paid. In contrast, heterologous matched linker pairs can 
facilitate chain transfer between modules as efficiently as natural pairs. Second, the 
selectivity associated with linker pair interactions suggests that they might play a key role 
in the transient assembly of functionally paired complexes of the three DEBS proteins 
(Aparicio, J. F., et al, (1994) J. Biol. Chem. 269, 8524-8528; Pieper, R.,et al, (1995) 
Nature 378, 263-266). In support of this hypothesis, we have demonstrated that a peptide 
mimic of the N-terminal linker of DEBS2 can inhibit chain transfer mediated by the 
linker pair at the DEBS1-DEBS2 interface but not by the linker pair at the DEBS2- 
DEBS3 interface (Figure 8). Third, we show that the N-terminal linker peptide of 
DEBS2 has significant (ca. 50%) helical content (Figure 9). This observation is 
consistent with secondary structure analyses of individual linker sequences, which 
previously indicated a propensity of these sequences to assume coiled-coil 
conformations. 2 It should be emphasized that, in order for a coiled-coil motif to facilitate 
noncovalent docking of two modules, the heterodimer (or heterooligomer) formed by 
associations between the C-terminal linker and N-terminal linker must be 
thermodynamically more favorable than either homodimer. Although the data shown in 
Figure 9 show helical content of a linker, it does not provide evidence for formation. of a 
coiled-coil structure. Thus, direct evidence for the existence of coiled coils at 
intermodular interfaces, as well as their implications for selective chain transfer, remains 
to be obtained. 
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[00130] Earlier studies have demonstrated the importance of two additional molecular 
recognition events in controlling the overall programming and specificity of PKSs. First, 
individual modules can discriminate among alternative incoming substrates (Wu, N., et 
aL, (2000) J. Am. Chem. Soc. 122, 4847-4852); this selectivity appears to reside within 
the individual ketosynthase domains (Jacobsen, J. R. ? et aL, (1997) Science 277, 367-369; 
Chuck, J., et aL, (1997) Chem. Biol. 4, 757-766). Second, ketosynthase and ACP 
domains appear to have some degree of mutual recognition (Dreier, J., et aL, (1999) J. 
Biol Chem. 274, 25108-25112; Ranganathan, A., et aL, (1999) Chem. Biol. 6, 731-741). 
Both of these recognition properties are localized within highly conserved and 
catalytically critical parts of the large PKS modules. Here we define and dissect a third 
element of selectivity. In contrast to the previously recognized factors influencing 
molecular recognition by PKS components, linker-mediated intermodular interactions 
have been localized to short, nonconserved regions that lie outside the core modules and 
have no influence on the intrinsic chemistry of the individual modules. 

Example 7 

Methods Directed Towards Assessing the Effects of Protein-Protein Interactions and 
Enzyme-Substrate Interactions in the Channeling of Intermediates between Polyketide 

Synthase Modules 

[00131] Construction of Plasmids. The gene encoding ACP4(4) was amplified as an 
NdcI-EcoRl PCR fragment (523 bp) using the primers 5'- 
CCATATGGTGGTCGACCGGCTCG-3'and 5'-GAATTCCTA- 
CAGGTCCTCTCCCCC- 3' (sequences complementary to DEBS shown in bold). The 
PCR product was cloned into pET28a (Novagen) to yield plasmid pNW8. Plasmid 
pST157 encodes a bimodular fusion between module 1 of DEBS1 and module 5 of 
DEBS3, with the thioesterase domain fused downstream of module 5 ("M1+M5+TE"). 
This fusion, which was engineered by taking advantage of the natural, conserved BsaBI 
sites located at the start of the KS domains of modules 2 and 5, also includes the loading 
didomain of DEBS 1. The "linker" sequence that covalently bridges the fused modules is 
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the natural sequence between modules 1 and 2, as in DEBS1. The fusion junction 
between module 5 and the thioesterase domain is identical to that in plasmid pRSG46. 
Similarly, plasmid pST92 encodes an "M1+M6+TE" bimodular fusion. Its construction, 
which is completely analogous to that of pST157, involves introduction of this bimodular 
PKS gene from pST96, Gokhale, et aL, Science 1999, 284, 482-485, as an Ndel-EcdRl 
into pET-21c (Novagen). The construction of genes encoding (5)M2+TE, (3)M3+TE, 
(5)M5+TE, and (5)M6+TE (pRSG64, pRSG64, pRSG46, and pRSG54, respectively) 
have been described previously, id., as well as the construction of a gene encoding 
(5)M3+TE (pST132). See Tsuji, et al., Biochemistry 2001. 

[00132] Expression and Purification of Proteins. All individual modules were 
expressed and purified as previously described. Wu, et al., Am. Chem. Soc. 2000, 122, 
4847-4852. The bimodular proteins were expressed as C-terminal His6-tagged fusion 
proteins, and their expression and purification schemes were identical to those previously 
described for the individual modules (id.), yielding 0.2 mg/L culture of purified 
M1+M5+TE and 1 mg/mL culture of purified M1+M6+TE. ACP4(4) was expressed by 
transforming pNW8 into E. coli BL21(DE3) cells (Novagen), which were then grown in 
LB at 37 °C to OD 60 o) 0.7-0.8. BL21(DE3)/pNW8 was induced overnight with 1 mM 
IPTG at 30 °C. The cells were harvested by centrifugation, washed with TE buffer, and 
then resuspended in disruption buffer (100 mM NaH 2 P0 4 (pH 7.2), 100 mM NaCl, 1.2 
mM DTT, 1.2 mM EDTA, 0.7 mM benzamidine, 1 mg/L pepstatin, 1 mg/mL leupeptin, 
and 15% glycerol) before lysis by French press. Following removal of the cell debris by 
centrifugation, the supernatant was treated with 0.1% (w/v) PEI to remove nucleic acids 
followed by a 55% (NH 4 ) 2 S0 4 precipita-tion. The resulting (NH 4 )2S0 4 pellet was 
resuspended in 100 mM NaH 2 P0 4 (pH 7.2), 2.5 mM DTT, 1 mM EDTA, 20% glycerol 
(buffer A). This suspension was desalted on a PD-10 gel filtration column (Amersham 
Pharmacia Biotech AB) equilibrated with 10 mM imidazole in 50 mM Tris (pH 8.0), 1 M 
NaCl, 20% glycerol (buffer B), and the eluant was loaded at 1 mL/min onto a Flex- 
column (Kontes) packed with 5 mL of Ni NTA-Superflow resin (Qiagen) using a 
peristaltic pump. After being washed with 35 mM imidazole in buffer B for ACP4-(4), 
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the His6-tagged protein was eluted from the resin with 90 mM imidazole in buffer B. The 
appropriate fractions were concentrated, and the buffers were exchanged to buffer A + 
1.5 M (NH 4 ) 2 S04 in Centriprep 10 spin columns (Amicon). Using an Akta FLPC system 
(Amersham Pharmacia Biotech AB), the concentrated protein was loaded at 1 mL/min 
5 onto a XK 1 6/20 column packed with 30 mL of Phenyl Sepharose High Performance 

resin and equilibrated with the same buffer. A gradient from 750 mM (NH 4 )2S0 4 to 0 
mM (NH 4 )2S04 in buffer A was applied which eluted the protein at 0 mM (NH 4 ) 2 S0 4 . 
The appropriate fractions were concentrated in Centriprep 10 spin columns to yield 
approximately 10-15 mg/L of purified protein which was flash frozen and stored at -80 

jjj 10 °C. The mass of apo-ACP4(4) was confirmed by MALDI-MS (calculated mass: 20492, 

**f observed mass: 20507). (MW - methionine) was also observed. 

hk 

% [00133] Synthesis of CoA Thioester Diketides. The carboxylic acids of the diketides 

*N were synthesized as previously described. Harris, et al., J. Chem. Res, (S) 1998, 6, 283. 

P 15 They include the (2S,3R), (2R,3S), (2R f 3R), and (2S,3S) diastereomers of 2-methyl-3- 

S{ hydroxy-pentanoic acid. These carboxylic acids were converted to CoA thioesters 5a-d 

*F under the following conditions. See Belshaw, et al., Science 1999, 284, 486-489; 

jy Robertson, et al., J. Am. Chem. Soc 1991, 113, 2722-2729. Carboxylic acid (3.4 mg, 26 

jHinol), CoASH (sodium salt, 1.1 equiv, Sigma), and PyBOP (1.5 equiv, Novabiochem) 
20 were dissolved in 0.39 mL of THF and 0.39 mL of 4% K 2 C0 3 and stirred under argon for 

40 min. The reaction mixture was diluted to up 5 mL with H20 and injected onto a 
Beckman Ultrasphere Cig HPLC column (250 x 10 mm) equilibrated with 50 mM 
NaH 2 P0 4 (pH 4.2) in 10% MeOH/H 2 0. Using a 10 mL/min linear gradient over 30 min 
to 50 mM NaH2P04 (pH 4.2) in 80% MeOH/H 2 0, the CoA thioesters eluted at 55% 
25 MeOH. After removal of the MeOH on a rotavap, the product was desalted by 

reinjection on the same column equilibrated with 10% MeOH/H 2 0 followed by elution 
with 90% MeOH. The product was lyophilized and verified by MALDI-MS (theoretical 
mass: 881.742; observed mass: 882.191) and 5 H NMR (500 MHz) in H 2 0. 5a: 0.71 (s, 
3H), 0.85 (s, 3H), 0.86 (t, 3H), 1.08 (d, 3H), 1.48 (m, 2H), 2.38 (t, 2H), 2.76 (m, 1H), 
30 2.96 (t, 2H), 3.28 (t, 2H), 3.41 (t, 2H), 3.52 (dd, 1H), 3.73 (td, 1H), 3.79 (dd, 1H), 3.98 (s, 
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1H), 4.20 (t, 2H), 4.55 (t, 1H), 4.79 (m, 2H), 6.13 (d, 1H), 8.21 (s, 1H), 8.51 (s, 1H). 5b: 
0.72 (s, 3H), 0.86 (s, 3H), 0.89 (t, 3H), 1.11 (d, 3H), 1.44 (m, 2H), 2.40 (t, 2H), 2.79 (m, 
1H), 2.97 (t, 2H), 3.30 (t, 2H), 3.43 (t, 2H), 3.52 (dd, 1H), 3.75 (td, 1H), 3.80 (dd, 1H), 
3.99 (s, 1H), 4.21 (m, 2H), 4.56 (m, 1H), 4.75 (m, 1H), 4.80 (m, 1H), 6.15 (d, 1H), 8.24 
5 (s, 1H), 8.54 (s, 1H). 5c: 0.66 (s, 3H), 0.78 (s, 3H), 0.78 (t, 3H), 0.97 (d, 3H), 1 .26 (m, 

1H), 1.48 (m, 1H), 2.31 (t, 2H), 2.71 (m, 1H), 2.89 (m, 2H), 3.22 (m, 2H), 3.34 (m, 2H), 
3.45 (dd, 1H), 3.59 (dt, 1H), 3.72 (dd, 1H), 3.90 (s, 1H), 4.14 (m, 2H), 4.49 (m, 1H), 4.73 
(m, 1H), 4.84 (m, 1H), 6.09 (d, 1H), 8.25 (s, 1H), 8.50 (s, 1H). 5d: 0.66 (s, 3H), 0.78 (s, 
3H), 0.78 (t, 3H), 0.97 (d, 3H), 1.27 (m, 1H), 1.48 (m, 1H), 2.31 (t, 2H), 2.71 (m, 1H), 
jf 10 2.89 (m, 2H), 3.22 (m, 2H), 3.34 (t, 2H), 3.46 (dd, 1H), 3.59 (m, 1H), 3.73 (dd, 1H), 3.90 

6 (s, 1H), 4.14 (m, 2H), 4.49 (m, 1H), 4.75 (m, 1H), 4.82 (m, 1H), 6.09 (d, 1H), 8.25 (s, 

U 1H), 8.50 (s, 1H). Concentrations of solutions of CoA thioesters were determined by 

"j? A 2 6o measurement and calibration against known CoA concentration standards. Yield: 

£ 9.6 jumol (37%). 

pl5 

W [00134] Formation of Holo-ACP and Acyl-ACP from Apo-ACP. The 

4= phosphopantetheinylation reactions were catalyzed by the Sfp phos-phopantetheine 

n\ transferase, see Quadri, et al., Biochemistry 1998, 37, 1585-1 595; Weinreb, et al., 

Biochemistry 1998, 31, 1575-1584, under the following conditions: 150 juM apo ACP, 4 
20 equiv CoASH (lithium salt, Sigma) or acyl-CoA 5a-d, 0.2 equiv Sfp in 100 mM 

NaH 2 P0 4 (pH 6.6), 10 mM MgCl 2 , 2.5 mM DTT, 20% glycerol at 37 °C for 20 min. 
Excess small molecules and Sfp were removed from the phosphopantetheinlyated ACPs 
by applying the reaction mixture with an Akta FPLC system to a 6 mLResource Q 
column (Amersham Pharmacia Biotech AB) and eluting with a linear gradient from 0 
25 mM NaCl to 500 mM NaCl in buffer A. The desired proteins eluted at 220 mM NaCl 

and were concentrated using Centriprep 10 spin columns. Protein concentrations were 
determined using a modified Lowry assay (Sigma), and the masses were confirmed by 
MALDI-MS or + ESI-MS (4a: observed mass = 20945, calculated mass = 20944; 4b: 
observed mass = 20964; 4c: observed mass = 21056; 4d: observed mass = 20992). 

30 
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[00135] Qualitative Substrate Incorporation Assays. The reaction buffer for the 
diketide incorporation assays contained 400 mM NaH 2 P0 4 (pH 7.2), 2.5 mM DTT, 1 mM 
ETDA, 20% glyercol (reaction buffer C). 1 fM module, 20 jiM acyl-ACP, 500 /xMdl- 
[2- 14 C]-methylmalonyl CoA (ARC), and 4 mM NADPH (Sigma) were incubated in 20 
fiL of the reaction buffer at 30 °C for 1 .5 h. The reactions were either spotted directly 
onto a TLC plate (Whatman 250 fiM silica gel, UV254), or first extracted with EtOAc 
followed by spotting of the organic extracts on a TLC plate. The TLC plates were 
resolved using 60% EtOAc in hexanes, and the radioactive products were visualized on a 
Packard Instantlmager. 

[00136] Verification of Reaction Products. Triketide lactone products 3a and 3b 
derived from 2a (or 4a) and 2b (or 4b), respectively, have been previously verified.26 To 
verify the triketide lactone products 3c and 3d, reaction extracts were purified by 
preparative TLC. The ethyl acetate extracts of the spots corresponding to the triketide 
lactones were concentrated and then derivatized to TMS ethers by incubation with 50 /xL 
o/N;0-bis-(trimethylsilyl) trifluoroacetamide (Aldrich) for 30 min at room temperature. 
See McPherson, et al., 1 Am. Chem. Soc. 1998, 720, 3267-3268. Injection of the sample 
onto a GC-MS yielded fragmentation peaks at molecular weights 73 and 171, 
corresponding to cleavage between the oxygen and silicon atoms, as expected. Mass 
spectral confirmation data of the j3-ketolactone equivalents of 3c and 3d were obtained 
sans derivatization and by ESI-MS. The elution pattern of the triketide lactones from a 
chiral HPLC column is described below. 

[00137] Determination of k cat Values. The assays for kinetic measurements were 
performed in reaction buffer C and with the same concentrations of NADPH and 14 C- 
methylmalonyl CoA as for the qualitative assays. Saturating concentrations of propionyl- 
CoA were added to and the ACP substrates were excluded from the bimodular reactions. 
To quench the reactions, 20 /xL reaction aliquots were mixed with 80 fiL of 12.5% SDS. 
kcat values for the acyl-ACP substrates were determined by measuring steady-state 
saturating rates at multiple substrate concentrations (varying from 40 to 90 fxM 4). For 
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reactions that did not saturate by 90 iM of substrate, the kcat values are reported as lower 
limits. Workup and visualization of the reaction products were identical to those for the 
qualitative assays. 

[00138] Determination of kcat/K M values. The assays for determination of 
(kc at /KM)rei were performed with two competing substrates in the same reaction under the 
same conditions as described above for the qualitative assays, except the reaction 
volumes were doubled to 40 /xL. The data were fit into the equation where S A and S B are 
the two competing substrates and P A and P B are the corresponding products derived from 
S A and S B , respectively. The unknown, absolute kc at /KM values could then be obtained 
from known, absolute kc a t/K M data that had been derived directly from the initial slopes of 
v versus [S] plots. See McPherson, et al., J. Am. Chetn. Soc. 1998, 120, 3267-3268. Each 
reaction was done in duplicate at two different ratios of substrate concentrations. The 
reactions were quenched with 120 jitL of 12.5% SDS, and the products were extracted 
with 2 x 300 /xL of EtOAc. The organic extracts were purged of highly polar compounds 
as well as particulates by flash chromatography through 50 fiL of silica gel in a 1-mL 
polypropylene pipet attached to a 3-mm, 0.22-/mi nylon syringe filter (Osmonics, Inc.), 
eluting with 1.5 mL of EtOAc. Following removal of the organic solvents, the residual 
extracts were resuspended in 20 /xL of hexane and loaded onto a 250 x 4.6 mm Chiralpak 
AS column with the corresponding guard column (Daicel Chemical Industries) that had 
been equilibrated with 5% EtOH (Reagent Alcohol, Fischer) in hexane. With a flow rate 
of 0.8 mL/min, the products were separated using a 20 min gradient (starting at 2 min) 
from 5 to 15% EtOH in hexane. The reduced triketide lactone products 3a-d eluted at 
20.0, 17.0, 21.5, and 18.5 min, respectively. The unreduced triketide lactone products, 
derived from 4c and 4d, eluted at 21.0 and 19.0 min, respectively. The appropriate 
fractions were collected, and the radio-active products were detected and quantified using 
Formula-989 liquid scintillation cocktail fluid (Packard) on a Beckman LS3801 liquid 
scintillation counter. 
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[00139] Labeling of Holo-ACP4(4) with 14 C-2a Mediated by (5)M2+TE. Holo- 
ACP4(4) (20 jiM) was incubated with 1 mM [l- 14 C]-labeled 2a (custom synthesized by 
Amersham Pharmacia, specific activity 55 mCi/mmol) and 1 jjM (5)M2+TE in reaction 
buffer C for 10 min at 30 °C. The protein was precipitated with 75% acetone/H 2 0 for 5 
min at -80 °C. After washing the pellet with 6.25% (w/v) TCA to remove excess salts 
followed by 500 [iL of 75% acetone/H 2 0 to remove residual, unbound 14 C-2a, the 
precipitated protein was resuspended in 8 /iL of buffer A and 4 [iL of SDS sample buffer, 
and resolved on a 4-20% SDS-PAGE gradient gel (Bio-Rad). The proteins were 
visualized with Coomassie blue stain and dried, and the radioactivity was detected either 
on a Packard Instantlmager or by exposing the gel to X-ray film. 

[00140] Nomenclature. The nomenclature used in this report for proteins containing 
linker regions is similar to that adopted in a previous publication, (id.) Specifically, the 
module of origin of the linker is placed in parentheses either before or after the name of 
the domain or module to which it is attached, depending on whether it is an N- or a C- 
terminal linker, respectively. For example, a fusion protein comprising the ACP domain 
from module 2 of DEBS and the C-terminal linker of module 4 is referred to as ACP2(4); 
likewise, a protein comprising the N-terminal linker of module 5 fused to module 6 is 
referred to as (5)M6. 

[00141] Construction and Expression of Bimodular Enzymes. Analogous to 
DEBS1+TE described earlier, Cortes, et al., Science 1995, 2<58, 1487-1489; Kao, et aL, J. 
Am. Chem. Soc. 1995, 777, 9105-9106, M1+M5+TE (module 1 + module 5 + TE) and 
M1+M6+TE are heterologous fusions of DEBS module 1 with DEBS modules 5 and 6, 
respectively. The natural linker between modules 1 and 2 in the wild-type DEBS1 
protein was preserved in each case. In addition, the DEBS thioesterase (TE) domain was 
fused to the C termini of each downstream module to facilitate turnover by catalyzing the 
release of the triketide product. These two proteins were expressed as C-terminally His6- 
tagged proteins and purified on a hydrophobic butyl sepharose column followed by a 
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Resource Q ion-exchange chromatography to yield approximately 0.2 mg/L culture of 
purified M1+M5+TE and 1 mg/L culture of purified M1+M6+TE. 

[00142] Kinetic Analysis of Bimodular Constructs. In earlier studies on the kinetic 
properties of individual modules (Wu, et aL, Am. Chem. Soc. 2000, 722, 4847-4852), 
substrates were diffusively presented to the KS domain of each module as free N- 
acetylcysteamine (NAC) thioesters. This can be contrasted with the natural mode of 
chain transfer in a multimodular system, where acyl chains arrive at the KS domain via 
direct transfer from an upstream ACP domain (Figure 4). To explore whether the latter 
mode of substrate incorporation can have kinetic benefits over the former, the rates of 
triketide lactone 3a synthesis by M1+M5+TE and MRM6+TE were measured in the 
presence of saturating concentrations of propionyl-CoA, methylmalonyl-CoA, and 
NADPH. The kcat values for these two hybrid PKSs were determined to be 3.1 ± 0. 1 and 
4.1 ± 0.4 min" 1 , respectively (Figure 12). These parameters compare well with the 
maximal rate of 4.8 min' 1 for DEBS1+TE (Pieper, et aL, Biochemistry 1997, 36, 1846- 
1851). In contrast, we have shown earlier that whereas module 2+TE and module 6+TE 
turn 2a over with comparable rate constants (kc at = 4.6 and 17 min -1 , respectively), 
module 5+TE is a significantly weaker catalyst for the same reaction (kcat = 0.25 min" 1 ). 
Addition of exogenous 2a to the reaction catalyzed by the bimodular proteins had no 
effect on their overall catalytic rates. The implications of these results to intramodular 
substrate channeling will be evaluated in the discussion section. 

[00143] Construction and Expression of Individual ACPs. ACP4-(4) includes the 
entire DEBS ACP4 catalytic domain with its natural C-terminal linker. (The ACP linker 
is defined as the residues between the ACP consensus sequence and the C terminus of the 
polypeptide. See Tsuji, et aL, Biochemistry 2001). This gene was expressed as a 20.5 
kDa N-terminally His6-tagged protein to preserve the natural sequence of the C-terminal 
linker. ACP4(4) was purified by affinity chromatography on a nickel column followed 
by a hydrophobic phenyl sepharose column to yield approximately 10-15 mg/L culture of 
purified apoprotein. 
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[00144] Chemoenzymatic Synthesis of Acyl-ACPs. Preparations of the CoA 
thioesters of the natural diketide substrate of module 2, its enantiomer, its C-3 epimer, 
and its C-2 epimer (Figurel 1) were carried out as described in the Materials and Methods 
section. Phosphopantetheinylation of apo-ACP4(4) was cata-lyzed by Sfp (Quadri, et al., 
Biochemistry 1998, 37, 1585-1595; Weinreb, et al., Biochemistry 1998, 57, 1575-1584), 
(Figure 10C, first reaction) to generate acyl-ACP4(4) adducts 4a-d. Purification of these 
acyl-ACPs, as described in the Materials and Methods section, led to >95% pure 
materials as judged from SDS-PAGE. Complete phospho-pantetheinlyation was verified 
by MALDI-MS or ESI-MS. 

[00145] Qualitative Assays of Diketide Incorporation by Acyl-ACPs. The acyl- 
ACP4(4) adducts 4a-d were incubated individually with (5)M2+TE, (5)M5+TE, and 
(5)M6+TE in the presence of saturating concentrations of 14 C-methylmalonyl CoA 
extender unit and NADPH. For a given acyl-ACP, the products from modules 2+TE, 
5+TE, and 6+TE were expected to be identical (Figure 11), since the modules catalyze 
the same set of reactions with identical stereocontrol (albeit normally on very different 
natural substrates). Both 4a and 4b were accepted and extended by all three modules. 
Likewise, the corresponding NAC-thioesters 2a and 2b have been shown to be substrates 
for the three modules. 26 Remarkably, however, 4c and 4d were also substrates for the 
three modules, even though no turnover of the corresponding NAC thioesters 2c and 2d 
was detected in the case of any module, Wu, et al, Am. Chem. Soc. 2000, 122, 4847-4852 
(It should be noted that elongation of 4c and 4d by modules 5+TE and 6+TE yielded . 
minor quantities of unreduced triketide lactones, indicating less efficient /3-ketoreductase 
activity on these two anti-diketide substrates than on the two syn-diketide substrates.). 
Consistent with previous linker studies, 23,27 while all four acyl-ACP adducts were 
observed to be substrates for (5)M3+TE, no product formation was observed from the 
incubation of any of the acyl-ACP adducts with (3)M3+TE, even though 2a and 2b have 
previously been shown to be readily incorporated and elongated when presented to either 
module 3+TE derivative, Wu, et al, Am. Chem. Soc. 2000, 722, 4847-4852; Tsuji, et al, 
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Biochemistry 2001 . Thus, matched linker pairs appear to be capable of enhancing the 
efficiency with which otherwise poor substrates can be channeled between modules. 
Conversely, mismatched linkers can present a major barrier to the channeling of 
otherwise acceptable substrates between modules. Control studies performed with 2a and 
5a showed that the two compounds are approximately equivalent substrates for the same 
modules (data not shown). From the amount of product detected in these ACP-mediated 
reactions, the efficiency of the PKS-catalyzed reaction could be estimated. Under typical 
assay conditions (20pM 4 and 1/xM (5)M5+TE), 70% of the acyl-ACP was converted 
into triketide lactone in 1 h. Two conclusions can be drawn from this result. First, acyl- 
ACPs are significantly superior substrates to acyl-NAC thioesters. (Typically, millimolar 
concentrations of the NAC thioester must be used to detect comparable amounts of 
product under otherwise similar assay conditions.) Second, the assay system described in 
Figure 1 1 allows for monitoring multiple turnovers of the enzyme. Indeed, as described 
below, in all cases the maximum rates of consumption of the acyl-ACP substrates were 
comparable to or higher than the maximum rates of consumption of their NAC thioester 
counterparts (see below). Therefore, the association of the donor ACP and the acceptor 
module must be transient, and the dissociation rate constant of the ACP from the module 
must be significantly faster than the slowest step in the module-catalyzed elongation 
sequence. 

[00146] Kinetic Analysis of Incorporation of Diketides from Acyl-ACPs. The 

kcat/KM values for the reactions of 4a and 4b with (5)M2+TE, (5)M5+TE, and (5)M6+TE 
are shown in Figure 13. Since full saturation curves could not be obtained for the ACP- 
bound substrates for technical reasons, the k^/ECM value for 4b was derived by 
competitive assay against 2a (whose absolute kc^/Ku value was derived from the initial 
slope of the v versus [S] saturation curve). Likewise, the kc a t/K M value for 4a was 
derived by competitive assay against 4b. Several observations are noteworthy regarding 
the data summarized in Figure 13. First, 2a and 4a are significantly better substrates than 
2b and 4b for each module. In addition, the improvement in specificity for an acyl-ACP 
adduct over its NAC thioester counterpart is particularly pronounced in cases where the 
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NAC thioesters are exceptionally poor (e.g., module 2+TE-catalyzed elongation of 4b 
versus 2b, module 5+TE-catalyzed elongation of 4a versus 2a, and especially module 
5+TE-catalyzed elongation of 4b versus 2b). The implications of these observations are 
elaborated in the Discussion section. 
5 [00147] To quantify the kinetic advantage of channeling in the above assay system, the 

kcat values for the reactions of 4a-d with modules 2+TE, 5+TE, and 6+TE were measured 
(Figure 14). Kinetic measurements were performed at substrate concentrations between 
40 and 90/iM of each acyl-ACP substrate. For reactions that yielded measurable 
quantities of unreduced triketide lactone products (i.e., elongation of 4c and 4d by 
jr 1 0 modules 5+TE and 6+TE), both reduced and unreduced products were combined and 

P included for calculations of the kinetics parameters. Except for the reaction of 4a with 

as 

y. module 2+TE, none of the substrates saturate the enzymes in this concentration range, 

*2 and thus, the kc at values are reported as lower bounds. Comparison of the kc at values of 

!¥ p. the acyl-ACP forms of the four diastereomeric diketides with those from the 

L 15 corresponding reactions involving NAC thioesters substrates (which are also shown in 

W Figure 14) (Wu, et al., Am. Chem. Soc. 2000, 122, 4847-4852) affords two interesting 

42 observations. First, in those cases where the NAC thioester is a reasonably good 

SI substrate (e.g., module 2+TE- or module 6+TE-catalyzed elongation of 2a and 4a), the 

maximal reaction rates are comparable regardless of whether the acyl chain is bound to 
20 NAC or an ACP with a matched linker. In contrast, where the NAC thioester is an 

inferior substrate (e.g., module 2+TE-catalyzed elongation of 2b and 4b,M5+TE- 
catalyzed elongation of 2a and 4a, module 5+TE-catalyzed elongation of 2b and 4b, and 
elongation of 4c and 4d by any of the modules), tethering the same acyl chain to an ACP 
with a matched linker can result in significant improvements in kcat- Second, the maximal 
25 rate of turnover of 2a and 4a is significantly greater than that of 2b and 4b for all tested 

modules. Again, the implications of these findings are discussed below. 

[00148] Investigation of the Reversibility of the Donor ACP to Acceptor KS 
Transfer Reaction. Ordinarily, the flow of intermediates in a metabolically active PKS 
30 is vectorial. A possible mechanism for such directionality could be that, once an acceptor 
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KS is acylated with the incoming chain, conformational changes in the module prevent 
the pantetheine arm of the donor ACP from accessing the active site again. To test 
whether this may be the case, holo-ACP4(4) was incubated with I4 C-2a in the presence 
and absence of (5)M2+TE (Figure 15). The ACP was radiolabeled only in the reaction 
containing (5)-M2+TE. (As expected, (5)M2+TE was also labeled.) When 
methylmalonyl Co A extender units and NADPH were added to induce catalytic activity 
of the module, (5)M2+TE dependent labeling of ACP4(4) was also observed, but the 
degree of labeling was considerably reduced. Thus, there does not appear to be an 
absolute barrier to the back-transfer of an acyl chain from a KS to the ACP of the 
preceding module. 

[00149] We have previously investigated the substrate specificity of individual 
modules of DEBS using diketide substrates activated as N-acetylcysteamine (NAC) 
thioesters (Figure 10A) (id.). The diketides included in the previous study were the four 
diastereomeric forms of the natural substrate for module 2 (Figure 1 1). These substrates 
were assayed against DEBS modules 2, 3, 5, and 6~each with a TE domain fused to the 
C terminus to facilitate turnover-and steady-state kinetic parameters were determined for 
each substrate-enzyme combination. The substrate specificity profiles (as reflected in the 
k ca t/K M values) for the four enzymes were found to be remarkably similar in that all of the 
modules preferred 2a over 2b, and, within detection limits, neither of the two anti- 
diketides 2c or 2d was observed to be a substrate for any of the modules. 
[00150] The preference of 2a over its enantiomer 2b for all modules was especially 
intriguing in light of the fact that the natural substrates for modules 3 and 6 share more 
structural similarities to 2b than to 2a. One explanation for this discrepancy was that the 
NAC thioester-based assay system (Figure 10A) may not entirely represent the 
mechanism of acylation of a multimodular system. While NAC thioesters substrates 
must be loaded diffusively onto the KS of a module (Figure 10A), polyketide 
intermediates are channeled from the ACP of one module to the KS of the downstream 
module via covalent transfer. Substrate channeling in a multimodular system can occur 
either between two modules on the same polypeptide (e.g., between modules 1 and 2; 
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Figure 10B), or between two modules on separate polypeptides (e.g., between modules 2 
and 3; Figure 10C). Further evidence that protein-protein interactions may influence the 
substrate specificities of individual modules emerged from previous experiments 
suggesting, that inter-polypeptide linkers-defined as the highly variable regions outside 
the consensus sequences of the modules-are involved in mediating selective 
intermodular chain transfer, Gokhale, et al., Science 1999, 2#4, 482-485; Tsuji, et al., 
Biochemistry 2001 . To investigate the balance of protein-protein interactions and 
enzyme-substrate interactions in controlling polyketide chain elongation, two assay 
systems that take intermodular interactions into account were used in this study. 

[00151] Kinetic Channeling in Intrapolypeptide Chain Transfer. In the first 
system (Figure 10B), the effect of substrate channeling between two modules within the 
same polypeptide was investigated. More specifically, in the context of the bimodular 
constructs M1+M2+TE (DEBS1+TE), M1+M5+TE, and M1+M6+TE, DEBS modules 2, 
5, and 6 were examined for their abilities to accept and elongate the natural diketide 
intermediate that was passed from a covalently attached module 1. The turnover number 
of M1+M5+TE was comparable to that of M1+M6+TE or the "wild-type" M1+M2+TE. 
In contrast, when primed diffusively by 2a, the maximum catalytic rate of module 5+TE 
is significantly reduced compared to that of module 2+TE or module 6+TE. This 
disparity indicates that covalent connection of modules can have a beneficial kinetic 
effect and hinted that, in addition to physical channeling of intermediates, multimodular 
PKSs are also capable of kinetic channeling of intermediates. However, since the only 
incoming substrate that could be tested using this assay system was the natural diketide, a 
better assay system was needed to explore the role of kinetic channeling more generally. 

[00152] Kinetic Channeling in Interpolypeptide Chain Transfer, The minimal 
donor protein requirement for substrate channeling to an acceptor module was postulated 
to be an ACP domain with an appropriate C-terminal linker. Therefore, we constructed, 
expressed, and purified the ACP4 domain and its natural C-terminal linker as an 
individual polypeptide. A variety of acyl groups were then covalently attached to the 
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phosphopantetheine arm of holo-ACP4(4) via a chemoenzymatic procedure (Figure 10C, 
first reaction). The resulting acyl-ACP4(4) adducts 4a-d were tested for their ability to 
transfer the attached diketides from ACP4(4) to the KS of an acceptor module (Figure 
10C, second reaction), where they could then undergo standard chain elongation to yield 
a triketide lactone (Figure 10C, third reaction). The small size of the ACP4(4) protein, 
together with high expression levels of soluble protein in Escherichia coli, allowed 
production of reagent quantities of this protein for use as a substrate in multiple turnover 
assays. The requirement of both an ACP and the linker was highlighted by the fact that 
corresponding CoA thioesters exhibited comparable kinetic parameters and that 
mismatched linkers led to a dramatic reduction in turnover efficiency in the case of 
module 3+TE. The latter feature is consistent with the linker hypothesis developed 
earlier. Id. Although the precise Km values for individual acyl-ACP substrates could not 
be measured in many cases, in all cases they were estimated to be approximately 2-3 
orders of magnitude lower than the reported K M values for the corresponding NAC 
thioester reactions (micromolar for acyl-ACPs versus millimolar for acyl-S-NACs), thus 
making acyl-ACPs excellent substrates for individual PKS modules. 

[00153] Implications of the Interpolypeptide Transfer Kinetics Data, The 

establishment of the acyl-ACP-based assay system allowed us to address two important 
questions regarding the relative balance of protein-protein interactions and enzyme- 
substrate interactions in multimodular systems. First, is the universal preference among 
the three tested modules for 2a over 2b preserved when the same substrates are delivered 
as acyl-ACP adducts? And second, under saturation conditions, can kinetic channeling of 
these diketide substrates be observed for any module? 

[00154] As seen in Figure 13, the preference for the (2S,3i?)-diastereomer over its 
enantiomeric (2i?,3S)-diastereomer by all modules is preserved regardless of whether the 
substrate is loaded by a channeling mechanism or by a diffusive mechanism. This 
conserved preference suggests that the catalytic steps in a given module that discriminate 
between different substrates remain unchanged whether modules are primed diffusively 
or by a channeling mechanism and that for module 2, at least, the most likely source of 



sd-81206 v2 



-53- 



PATENT 

Atty. Dkt.: 300622004620 



discrimination is the KS acylation step. Furthermore, the turnover numbers of the 
individual enzyme-catalyzed reactions reported in Figure 14 make a strong case for 
kinetic channeling in multimodular PKSs. When saturating concentrations of 2b are co- 
incubated with M5+TE in the presence of methylmalonyl-CoA and NADPH, the 
5 elongation rate constant is only 0.02 min" 1 . In contrast, when the same reaction is 

monitored using 4b as a substrate, the elongation rate constant fkcat) increases at least 25- 
fold. Similarly, a maximal rate increase of greater than 40-fold is observed in the 
elongation of 4a versus 2a by M5+TE. The effect may be even more pronounced for the 
two anti-diketides, whose elongation rates are below detectable limits (O.01 min" 1 ) when 
h\ 10 presented as NAC thioesters 2c and 2d, but are quite respectable ( - 1 min" 1 ) when 

presented as acyl-ACP adducts 4c and 4d. Of course, if the KM values for the two anti- 
jU diketides when presented as NAC-thioesters are significantly higher than the solubility- 

% limited concentrations that were used in the assay, then the apparent kinetic advantage of 

channeling the anti-diketides could be artificially high. Even so, these overall results 
Q 1 5 indicate that channeling dramatically increases the efficacy of poor substrates. In 

% addition, the results reported here provide insight into how multimodular PKSs can be so 

■S remarkably tolerant toward protein engineering, even though individual modules are 

fy fairly specific catalysts. 

20 [00155] The Reversibility of ACPn to KSn+1 Transfers. Finally, the ACP-mediated 

strategy for diketide loading onto acceptor modules also enabled us to address the 
question of reversibility of the transacylation reaction between the donor ACP and the 
recipient KS. While co-incubation of 14 C-labeled 2a with holo-ACP4(4) afforded 
essentially no labeling of the ACP, co-incubation of 14 C-labeled 2a with holo-ACP4(4) in 

25 the presence of (5)M2+TE gave both labeled (5)M2+TE and ACP4(4) (Figure 1 5 A). The 

proposed mechanism for the observed labeling is shown in Figure 15B and requires back- 
transfer of the acyl group from the KS to the ACP of the upstream module. Back-transfer 
was also observed under turnover conditions, albeit at a substantially reduced level. 
Thus, the observed directionality of chain transfer in the context of a multimodular PKS 

30 that is rapidly turning over appears to arise due to kinetic channeling of these 
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intermediates rather than a ratchet mechanism that explicitly precludes back-transfer. 
However, given the 20- fold excess of ACP to module, the occupancy level of the diketide 
on the ACP is quite low. Consequently, in a PKS where two modules on separate 
polypeptides exist in approximately equimolar ratios, reverse transfer from a downstream 
module to an upstream module occurs rarely and only at steps where a significant barrier 
for forward chain transfer is encountered. In contrast, for intermodular chain transfer 
between modules within the same polypeptide, the effective molarity of the donor ACP 
group is significantly higher, and reversible transfer may be more significant (but without 
chemical consequence). The likelihood of intrapolypeptide reverse transfer may explain 
why the loading didomain shows relatively low selectivity for a propionyl starter unit 
versus an acetyl starter unit (2:1), see Lau, et al, Biochemistry 2000, 39, 10514- 10520, 
whereas DEBS1+TE discriminates strongly (32:1) between the two starter units. See 
Pieper, et al., Biochemistry 1996, 35, 2054-2060. 

[00156] These studies represent the first direct observation of kinetic channeling of 
intermediates in a modular PKS. Several dramatic examples are presented for both 
intrapolypeptide transfers and interpolypeptide transfers where the maximal rate constant 
(kcat) for elongating a particular ketide substrate by a DEBS module increases 10- to 
>1 00-fold when the substrate is channeled relative to when it is diffusively presented. 
Linkers are shown to play an important role in kinetic channeling, although the 
contribution of other elements, such as the pantetheine arm or protein-protein interactions 
between the donor and recipient modules, cannot be excluded. In addition, our studies 
have also reinforced the fact that, while individual modules are tolerant of stereochemical 
diversity in diketides, they are at the same time fairly specific catalysts. In addition, their 
specificities and recognition features do not necessarily correlate with the structures of 
their natural substrates. Finally, we have shown that the transfer step from a donor ACP 
to an acceptor KS is a fundamentally reversible reaction. Structural and more detailed 
mechanistic studies on these remarkable multifunctional catalysts should be particularly 
interesting from the viewpoint of understanding the atomic basis for the phenomena 
described here. 
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[00157] As used herein, the terms "a", "an" and "any" are each intended to include 
both the singular and plural forms. 

[00158] Numerous modifications may be made to the foregoing systems without 
departing from the basic teachings thereof Although the present invention has been 
described in substantial detail with reference to one or more specific embodiments, those 
of skill in the art will recognize that changes may be made to the embodiments 
specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the specification, drawings, and 
claims. All publications or patent documents cited in this specification are incorporated 
herein by reference as if each such publication or document was specifically and 
individually indicated to be incorporated herein by reference. 
[00159] Citation of the above publications or documents is not intended as an 
admission that any of the foregoing is pertinent prior art, not does it constitute any 
admission as to the contents or date of these publications or documents. 
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